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Preface 



The papers in this volume were selected for presentation at the Ninth 
Annual International Symposium on Algorithms and Computation 
(ISAAC’98), held on December 14-16, 1998 in Taejon, Korea. Pre- 
vious meetings were held in Tokyo (1990), Taipei (1991), Nagoya 
(1992), Hong Kong (1993), Beijing (1994), Cairns (1995), Osaka 
(1996), and Singapore (1997). 

The symposium was jointly sponsored by Korea Advanced In- 
stitute of Science and Technology (KAIST) and Korea Information 
Science Society (KISS) to commemorate its 25th anniversary in co- 
operation with Ministry of Information and Communication, Korea 
Information Society Development Institute, and Korea Science and 
Engineering Foundation. 

In response to the call for papers, 102 extended abstracts were 
submitted from 21 countries. Each submitted paper was reported on 
by at least four program committee members, with the assistance 
of referees, as indicated by the referee list found in these proceed- 
ings. There were many more acceptable papers than there was space 
available in the symposium schedule, and the program committee’s 
task was extremely difficult. The 47 papers selected for presentation 
had a total of 105 authors, resident as follows: Japan 24, Germany 
17, United State of America 15, Taiwan 10, Hong Kong and Korea 
6 each, Spain 5, Switzerland and Australia 4 each, Austria, Canada, 
and France 3 each, Italy and Netherlands 2 each, and Greece 1. 

We thank all program committee members and their referees 
for their excellent work, especially given the demanding time con- 
straints; they gave the symposium its distinctive character. We thank 
all who submitted papers for consideration; they all contributed to 
the high quality of the symposium. 

Finally, we thank all the people who worked hard to put in place 
the logistical arrangements of the symposium — our colleagues and 
our graduate students. It is their hard work that made the sympo- 
sium possible and enjoyable. 
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The Discrepancy Method* 

(Invited Presentation) 



Bernard Chazelle 

Princeton University, USA 
and 

Ecole Polytechniqne, France 



Abstract. Discrepancy theory is the study of irregularities of distri- 
butions. A typical question is: given a “complicated” distribution, hnd 
a “simple” one that approximates it well. As it turns ont, many qnes- 
tions in complexity theory can be reduced to problems of that type. This 
raises the possibility that the deep mathematical techniques of discrep- 
ancy theory might be of utility to theoretical computer scientists. As 
will be discussed in this talk this is, indeed, the case. We will give sev- 
eral examples of breakthroughs derived through the application of the 
“discrepancy method.” 



In 1935, van der Corput [21,22] asked how uniform an infinite sequence of 
numbers in [0, 1] can be. Specifically, how slowly does the function 



D{n) 



sup 

0<X<1 



\Sn n [0, x\\ — nx 



grow with n, where consists of the first n elements in the sequence? A sur- 
prising theorem of Schmidt says that D(n) can never be in o(logn). 

Schmidt’s result sets a limit on how well a certain discrete distribution, 
a; |<5'„ n [0, a;] I, can simulate a continuous one, x nx: thus, a certain amount 
of discrepancy between the two distributions is unavoidable. A computer scien- 
tist reading this paragraph should go back to the previous sentence and replace 
the words “discrete” by “easy to compute” and “continuous” by “hard to com- 
pute.” The resulting sentence talks about efficiently computable distributions 
simulating intractable ones, which happens to be the driving issue behind the 
use of randomization in computer science. 

A probabilistic algorithm, by its very nature, requires access to a sequence 
of perfectly random bits. For all sorts of theoretical reasons there is considerable 
interest in finding ways to reduce the amount of randomness, ie, to get by with 
pseudorandom sequences, or even better, to eliminate randomness altogether 
(derandomization) . 

This is where discrepancy theory comes into play. The theory is blessed 
with many powerful tools and techniques developed over the last century. The 

* This work was supported in part by NSF Grant CCR-93-01254, NSF Grant GCR- 
96-23768, ARO Grant DAAH04-96- 1-0181, and NEG Research Institute. 
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discrepancy method bridges these tools with complexity theory and algorithm 
design. It has been the force behind major recent developments in areas as 
diverse as probabilistic algorithms, derandomization, communication complex- 
ity, searching, machine learning, pseudo-randomness, computational geometry, 
optimization, and computer graphics. The talk briefly samples some of these 
achievements: in particular, optimal algorithms for convex hull and Voronoi di- 
agrams [2], lower bounds for arithmetic circuits [4] and range searching [3], de- 
terministic linear-time algorithms for linear programming with a constant num- 
ber of variables [6,8,15], as well as results in pseudorandomness [9,10,12,18,19], 
communication complexity [11], derandomization [13,17]. For references on dis- 
crepancy theory in general, see [1,14,16,7,20], and on the discrepancy method in 
particular, see [5]. 
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Implementing Algorithms and Data Structures: 
An Educational and Research Perspective 
(Invited Presentation) 



Roberto Tamassia 

Department of Computer Science, Brown University 
Providence, RI 02912-1910, USA 
rt@cs .brown.edu 

http : //www. cs .brown.edu/people/rt/ 



Abstract. Anecdotal evidence shows that developing correct and effi- 
cient implementations of fundamental data structures and algorithms is 
a challenging task. We discuss educational and research issues in algo- 
rithm engineering. Topics include algorithmic patterns, the development 
of a library of data structures and algorithms in Java, and the use of 
design patters in teaching introductory data structures courses. 



1 Introduction 

Developing correct, efficient, and reusable implementations of fundamental data 
structures and algorithms is a challenging task (see, e.g., [15]). The LEDA 
project [11] is major effort in this direction. The emerging research area of algo- 
rithm engineering investigates techniques for the implementation, testing, and 
evaluation of algorithms in realistic environments and scenarios. 

In this talk, we discuss algorithm engineering issues from an educational 
and research perspective. We focus on methods for implementing algorithms 
and data structures so that they are generic and extensible and on the use of 
software engineering concepts, such as object-oriented programming and design 
patterns. 

2 Algorithmic Patterns 

Abstraction has been, in the past years, a key technique for handling the increas- 
ing complexity of programs. Algorithmic abstraction is the process aimed at the 
implementation of sophisticated algorithms as reusable software objects. It goes 
beyond both procedural abstraction (structured programming) and data abstrac- 
tion (object-oriented programming), by viewing algorithms as objects that can 
be manipulated at the programming language level. It allows the programmer to 
construct new algorithms by specifying modifications and extensions of existing 
ones, thus reducing the time spent coding and debugging. It allows predefined 
algorithms to be combined in complex ways. These predefined algorithms of 
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common use, called algorithmic patterns, are the main tools for algorithm ab- 
straction. In many cases, algorithmic patterns abstract out the core of several 
similar algorithms, and the programmer only has to implement the extensions 
of the core that are needed for a specific algorithm. 

In [5] , we describe an object-oriented design and implementation of the core 
steps of the GIOTTO algorithm [13] for constructing orthogonal drawings of 
graphs. The design is based on a new algorithmic pattern called algorithmic 
reduction, which provides conversion among different types of data structures. 
GIOTTO is particularly suitable as a case study for the use of algorithmic reduc- 
tions because it consists of many steps where the original graph is converted into 
various other structures (e.g., a flow network). 

In [14], we present the design and implementation of a new algorithmic pat- 
tern, called binary space partition search, that provides a unifying description 
of two well-known algorithms for the fundamental geometric problem of point 
location in a planar map [12]. We also show how to separate the topological, 
geometric, and numerical aspects of the computation. Further algorithmic pat- 
terns within the area of geometric computing are being investigated, including 
plane-sweep, lifting map, and fractional cascading. 



3 JDSL: A Library of Data Structures in Java 

JDSL, the Data Structure Library in Java, is a new library of fundamental data 
structures and algorithms being developed at Brown and at Johns Hopkins. 

The architecture of JDSL is partitioned into four different components, 
namely the combinatorial, the topological, the geometric, and the arithmetic com- 
ponents. Each component of JDSL corresponds to one or more Java packages. 
Each package consists of a collection of interfaces, and for each interface one or 
more reference implementations are provided. The interfaces are arranged in hi- 
erarchies that may extend across different packages or components (see Fig. 1). 
The design principles of JDSL do not directly depend on Java; their validity 
extends to C-|— I- and other object-oriented languages. 

Several reference implementations provided by JDSL are optimized for fast 
running time. For example, the tiered vector implementation of a sequence [7] is 
considerably faster than the standard Java vector class. JDSL also allows rapid 
prototyping of complex algorithms and data structures by means of combinations 
of algorithmic patterns. This programming style may yield implementations that 
are slower than ad-hoc ones due to the indirection overhead. 

In [6] we present two key concepts used in JDSL: positions and locators. Many 
applications require data structures that allow efficient access to their internal 
organization and to their elements. This feature has been implemented in some 
libraries with iterators or items. JDSL refines the notion of an item and splits it 
into two related concepts: positions and locators. Positions are an abstraction of 
a pointer to a node or an index into an array; they provide direct access to the 
in-memory structure of the container. Locators add a level of indirection and 
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Fig. 1. Partial view of the JDSL interface hierarchy. 
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allow the user to find a specific element even if the position holding the element 
changes. 

A number of implementation case studies have been conducted based on 
JDSL, including data structures for planar maps [9], animations of geometric 
algorithms [2], and the aforementioned work on point location [14] and graph 
drawing [5] 

The JDSL library has a companion teach version [10] suited for instructional 
use. It includes a simplified combinatorial component and provides support for 
visualizers and testers [1]. The code examples in the data structures textbook by 
Goodrich and Tamassia [8] are taken from this library. Classroom experience at 
Brown and Johns Hopkins has shown that the methodology of the library allows 
students to complete sophisticated course projects in a short amount of time. 



4 Teaching Data Structure Design Patterns 

Design patterns [3] are organizational concepts developed for designing quality 
object-oriented software. A design pattern provides a framework for a “typical” 
software design problem. It is a general template that can be specialized for the 
specific problem at hand. Design patters have been shown to save development 
time and yield software that is robust and reusable. Design patterns are impor- 
tant, but probably neglected by most instructors in the introduction to data 
structures course (CS2) and usually not taught until the software engineering 
course. 

In [4,8], we show how to incorporate design patterns in the teaching of an 
introductory data structures course (CS2). The specific design patterns we dis- 
cuss include adapter, template method, comparator, decorator, iterator, position, 
and locator. Incorporating these patterns does not require any major revisions 
to the CS2 curriculum, for they fit in naturally with the discussions of several 
components of CS2 (see Table 1). 



Design Pattern 


CS2 Topic 


adapter 


stacks and queues 


template method 


tree and graph traversals 


comparator 


priority quenes 


decorator 


balanced trees, graphs 


iterator 


sequences, trees, graphs 


position 


sequences, binary trees, graphs 


locator 


priority queues, dictionaries 



Table 1. Some design patterns and natural places in the CS2 curriculum where 
they can be introduced. 
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Especially important is the template method pattern, where a class that im- 
plements the skeleton of an algorithm is provided, and the steps that will vary 
in different implementations are delegated to its subclasses. Template methods 
can be introduced in CS2 during the discussion of tree and graph traversals. 
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Abstract. In this paper we address the Loo Voronoi diagram of polygo- 
nal objects and present applications in VLSI layout and manufacturing. 
We show that in Loo the Voronoi diagram of segments consists only of 
straight line segments and is thus much simpler to compute than its Eu- 
clidean counterpart. Moreover, it has a natural interpretation. In appli- 
cations where Euclidean precision is not particularly important the Loo 
Voronoi diagram can provide a better alternative. Using the Loo Voronoi 
diagram of polygons we address the problem of calculating the critical 
area for shorts in a VLSI layout. The critical area computation is the 
main computational problem in VLSI yield prediction. 



1 Introduction 

The Voronoi diagram of polygonal objects has been given considerable attention 
because of its numerous applications in diverse areas such as biology, geography, 
robot motion planning, computer graphics. In this paper we present applica- 
tions of Voronoi diagrams in VLSI layout and manufacturing. It is well known 
that the ordinary Voronoi diagram of polygonal objects has linear combinatorial 
complexity and consists of straight line segments and parabolic arcs. Several 
efficient algorithms using divide and conquer, plane sweep, or incremental con- 
struction are known for its computation (see [2,3] for a survey). However, the 
existence of parabolic arcs in the diagram makes it hard to compute in practice. 
In a recent paper [9], a new complexity model for geometric algorithms was in- 
troduced, called the degree of an algorithm, which characterizes the complexity 
of the test computations of an algorithm. In the construction of the Voronoi 
diagram of segments Burnikel showed that the well-known incircle test can be 
solved with degree 40 [4,9]. This is very high for practical problems involving 
Voronoi diagrams. 

In a recent paper Aichholzer and Aurenhammer [I] introduced a new type 
of skeleton for polygonal objects in the plane called the straight skeleton. The 
straight skeleton consists of angular bisectors between edges and thus consists of 
straight line segments. The straight skeleton captures the shape of the defining 
elements in a natural manner. Its main advantage over the Voronoi diagram is 
the elimination of parabolic arcs. However, straight skeletons do not provide the 
proximity information that Voronoi diagrams do and thus, they do not always 
provide an alternative solution to Voronoi diagrams. 



K.-Y. Chwa and O. H. Ibarra (Eds.): ISAAC’98, LNCS 1533, pp. 9—19, 1998. 
(c) Springer-Verlag Berlin Heidelberg 1998 
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In applications where Euclidean accuracy is not particularly important a 
practical solution to the high degree problem of Euclidean Voronoi diagrams of 
segments may be the use of a different geometry, in particular, the Loo-metric. 
The Loo Voronoi diagram of polygons (or segments) consists of straight line seg- 
ments and its combinatorial complexity is similar to the Euclidean case. More- 
over, if the input vertices are on rational coordinates the Voronoi vertices are 
also rational. An intuitive way to view the Loo Voronoi diagram of polygonal 
shapes is to view it as the locus of points corresponding to centers of isothetic 
squares^ that touch the boundary in at least two points. In contrast, the Eu- 
clidean Voronoi diagram can be regarded as the locus of centers of circles that 
touch the boundary in at least two points. Note that in the rectilinear case where 
the edges of the polygons are either horizontal or vertical, the Loo Voronoi dia- 
gram coincides with the straight-skeleton of [I]. 

Our motivation for considering Loo Voronoi diagrams comes from applica- 
tions in VLSI layout and manufacturing, as described below. Note that layout 
shapes have many orthogonal edges but are not necessarily rectilinear. The prox- 
imity information preserved in the Loo Voronoi diagram can be used to calculate 
the critical area, a measure used in estimating the sensitivity of a VLSI layout 
to defects during manufacturing. The critical area calculation is the main com- 
putational problem in predicting the yield of a VLSI chip. Other applications 
of Voronoi diagrams in extracting from the physical description of a design the 
equivalent resistance are discussed in [12,16]. The Loo version would be as good 
in this case and much simpler to obtain [15]. Due to the simplicity of Loo Voronoi 
diagrams we expect them to also find applications in other areas. For example, 
in [17] an automatic generation for finite element meshes for multiply connected 
planar domains with polygonal boundaries is described. The Euclidean Voronoi 
diagram of the domain was used as a starting point and could be substituted by 
the Loo version [15] which is easier to compute. 

2 Loo Voronoi Diagrams 

The Loo distance between two points p = (xp,yp) and q = {xq,yq) is the 
maximum between the horizontal and the vertical distance between p and q 
i.e., d{p,q) = \VL&yi{dx{p,q),dy{p,q)} where dx(p,q) = \xp - Xq\ and dy{p,q) = 
\yp ~yq\- The Loo distance between a point p and a line I is d{p, 1) = min{d(p, q), 
Wq € /}. Intuitively, the Loo distance between two elements (points or lines) can 
be defined in terms of the smallest isothetic square touching the two elements. 
We shall refer to a line of slope (±1) simply as a 45-degree line, denoted 45° line. 

Consider a point p and the four 45° rays emanating away from p (see Fig. 1). 
They partition the plane into four quadrants. In each quadrant the Loo distance 
between p and any point q simplifies to either the vertical (g in upper and lower 
quadrant) or the horizontal distance (g in right and left quadrant) between p 
and g. Any non-45° line I that does not contain p intersects two 45° rays at 

^ An isothetic square is one whose sides are parallel to the coordinate axes. 
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d(p,q) q 

X • A 

d{p,q) 

Fig. 1. The 45° rays emanating from p 
partition the plane into four quadrants. 




Fig. 2. The Loo distance form p to I 
is dx{p,q). 



points q and r. Assuming that dx(j>,q) < dx{p,r), the Loo distance between p 
and I is clearly dx{p,q) (see Fig. 2). 

Lemma 1. If the Loo distance from a point p to a line I of slope ±s, s > 0 is r, 
the Euclidean distance from p to I is Sir where Si = . 

VS‘^ + 1 

The bisector of two elements (points or lines) is the locus of points equidistant 
from the two elements. The Loo bisector can be regarded as the locus of centers of 
squares touching the two elements. Figures 3 and 4 illustrate the bisector of two 
points and two lines respectively. In case of two points along the same horizontal 
or vertical line the bisector consists of a line segment and two unbounded regions 
(shaded regions in Fig. 3). Without creating any significant difference we will 
assign one region to one point and consider only the outermost boundary of the 
bisecting region as the bisector (thick lines in Fig. 3). The bisectors of two lines 
have slopes given by the following lemma. 




Lemma 2. The Lao bisector of two non-vertical lines h and I2 of slopes b\ 
and 62; respectively, consists of two lines as follows: 

~ If bi > b 2 > 0 the bisector consists of two lines of slopes —1 and • 

~ If bi < b 2 < 0 the slopes of the bisector are +1 and . 

— Ifbi >0 and 62 < 0 the bisector has slopes 67^1743 ■ 

Note that if one of these two lines is vertical (i.e., bi = ±00), the Lao bisector 
consists of a 45° line and a line of slope (1 + 262) for 62 > 0 (resp. (—1 + 262) 
for 62 < 0). Moreover if 62 = —bi the bisector is a vertical and a horizontal line 
and if bi = 00, 62 = 0 the bisector consists of two 45° lines. 
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Fig. 4. The Loo bisector of two lines. Fig. 5. The Loo bisector between a 

point and a line. 



The bisector of a point p and a line I {p ^ I, see Fig. 5) consists of at most 
four parts, one for each quadrant of p. Each part corresponds to the portion 
of the bisector between I and a vertical or horizontal line through p depending 
on the quadrant of p. Thus, each part is a line segment or ray whose slopes 
can be derived by Lemma 2. The unbounded parts of the bisector are always 
parallel 45° rays (see Fig. 5). For a point p along a non-orthogonal line I the 
bisector is a 45° line through p and for a point p along an orthogonal line I the 
bisector is the area enclosed by the 45° lines through p. For simplicity we only 
consider the boundary of this region as the bisector between p and 1. 

The Loo Voronoi diagram of a set of elements S = {ei, C 2 , . . . , e„}, de- 
noted V(iS'), is a partitioning of the plane into regions, called Voronoi cells, 
each of which is associated with an element, called the owner, of the cell. The 
boundary that borders two Voronoi cells is called a Voronoi edge, and adjacent 
Voronoi edges of each Voronoi cell are common to a Voronoi vertex. The col- 
lection of Voronoi edges and vertices is also called the Voronoi diagram. Fig. 6 
illustrates the L^o Voronoi diagram in the interior of a simple polygon (the 
medial axis) and Fig. 7 illustrates the Loo Voronoi diagram of polygons. 

Let C? be a planar straight line graph on n points in the plane as defined 
in [1,3] i.e., a set of non-crossing line segments spanned by these points. By the 
above discussion it is clear that the Loo Voronoi diagram of a planar straight- 
line graph G consists only of straight line edges. Moreover, if the vertices of G 
are on rational coordinates the Loo Voronoi vertices are also rational. Hence, 
the Loo Voronoi diagram is computationally much simpler than its Euclidean 
counterpart. Most of the existing algorithms to compute the Euclidean Voronoi 
diagram of segments can be easily modified to compute the Loo Voronoi di- 
agram in O(nlogn) time. The well known incircle test to determine whether 
an element is in or out of the circle defined by three other elements (lines or 
points) now simplifies to a test involving the square defined by the three ele- 
ments. (Note that three elements need not always define a square). In the worst 
case the Loo incircle test corresponds to 1) determining the intersection point / 
of two lines (bisectors), 2) determining the intersection point J of a line and 
a 45° line through /, and 3) comparing the horizontal or vertical distances be- 
tween I and J and between / and one of the defining elements. Note that no 
square root computations are involved. 
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Fig. 6. The Lao medial 
axis of a simple polygon. 



Fig. 7. The Loo Voronoi 
diagram of polygons. 



Fig. 8. The Loo Voronoi 
diagram of Gt ■ 



The combinatorial complexity of the Loo diagram is similar to the Euclidean 
one. However, there is some difference in the number of Voronoi vertices produced 
by reflex angles. The number of 45° rays emanating from a concave vertex v 
depends on the slopes of the incident edges and can be either one, two, or three. 
In the Euclidean case the normals bisecting a vertex from the incident edges are 
always two and in the straight- skeleton of [1] the corresponding angular bisector 
is always one. Thus the combinatorial complexity of the L^o diagram can be 
lower or higher than the Euclidean one depending on the input. An exact bound 
can be derived following the proof of [3] for the Euclidean case. 

Lemma 3. Let G be a planar straight line graph on n points in the plane such 
that G has t terminals ( vertices of degree 1 ), of which are incident to non- 

orthogonal edges, T 2 reflex angles inducing two 45° rays, and r^ reflex angles 
inducing three 45° rays. The number of (finite and infinite) vertices of the L^o 
Voronoi diagram of G is exactly 2n + t + + T 2 + 2ra — 2. 

Note that in the Euclidean case this bound is 2n + t + r — 2 where r is the 
total number of reflex vertices (i.e., r = ri +T 2 +ra for r\ reflex vertices inducing 
a single 45° ray) [3]. In the rectilinear case the bound becomes 2n + t — 2 as for 
the straight skeletons. 

3 A Sweep-Line Algorithm for the Voronoi Diagram 

In [5], Dehne and Klein described a plane sweep paradigm for the Voronoi di- 
agram of points in “nice metrics” . In this section we modify their algorithm to 
accommodate segments in the Loo metric. 

Let G be a planar straight-line graph on n points. We will compute the Loo 
Voronoi diagram of G, V(G). Consider a vertical sweep-line L sweeping across the 
entire plane from left to right. At any instant t of the sweeping process the sweep- 
line partitions the set of segments (edges) of G into three subsets Si,Sm and Sr, 
corresponding to those that lie totally to the left of L, intersect L, and lie totally 
to the right of L, respectively. The segments in Sm cut L into l^ml + 1 sweepline 
segments, two of which are unbounded, denoted by L(. Let St denote the portions 
of segments in Sm to the left of L. In other words, St = {s(t) | s S S'm} where s{t) 
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denotes the portion of s to the left of L. At every instant t of the sweeping 
process we compute the Voronoi diagram of Gt = SiU StU Lt- Note that at any 
instant t the sweep-line segments are treated as being part of G. (In case C? is a 
collection of simple polygons and we only want to compute the Voronoi diagram 
in the exterior of the polygons we exclude from Lt the segments in the interior 
of polygons, Fig. 8). 

The boundary of the Voronoi cell of each segment in Lt is referred to as an 
individual wavefront and the collection of individual wavefronts for all segments 
in Lt is called the wavefront. In Fig. 8, the wavefront is shown in dashed lines. 
Note that in the case of points the whole wavefront is a single individual wave- 
front. Clearly, in the Loo metric the wavefront is y-monotone. The Voronoi edges 
that have an endpoint common with the wavefront are called spike bisectors. 

As the plane sweep proceeds, the wavefront, and therefore the endpoints of 
spike bisectors, as well as the endpoints of segments in St move continuously 
to the right until an event takes place which causes the wavefront to change. 
Following [5], we have two kinds of events: 1) events corresponding to the occur- 
rence of a vertex or a vertical edge in G, referred to as site events and 2) events 
corresponding to the intersection point of two neighboring spike bisectors, re- 
ferred to as spike events. A site event induced by a point p takes place at t = Xp, 
where Xp is the abscissa of point p. A spike event C takes place at t = Xc + w 
where Xc is the abscissa of the intersection point and w is the distance of the 
intersection from the inducing element. 

Throughout the plane sweep we implicitly maintain the wavefront in a data 
structure referred to as the sweep-line status which is implemented as a height- 
balanced binary tree T. In particular, the sweep-line status contains the spike 
bisectors and the edges in St. To simulate the wavefront movement to the right 
we parameterize the endpoints of spike bisectors and segments in St as linear 
functions of t. At any instant t the wavefront is the polygonal line obtained by 
connecting the endpoints of the elements in T in increasing j/-coordinate value. 
The events are organized in a priority queue referred to as the event list in 
increasing priority value. Due to space limit we omit the details of the plane 
sweep algorithm. The number of site and spike events is 0{n) and thus the time 
complexity of the algorithm is O(nlogn). 



4 The Too Voronoi Diagram and the Critical Area 
Problem 

The yield of a VLSI chip is the percentage of functional chips among all chips 
manufactured. Predicting the yield of a chip is important due to the growing need 
to control the cost of manufacturing. In this section we describe an important 
problem in VLSI yield prediction that can be solved accurately and efficiently 
using the Lqo Voronoi diagram of a layout. 

Yield estimation models are based on the concept of critical area which is a 
measure reflecting the sensitivity of the layout to spot defects caused by particles 
such as dust and other contaminants in materials and equipment. The main 
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reason for yield loss during manufacturing are “extra material” defects causing 
shorts between different conducting regions. For information on yield estimation 
and spot defects see for example [6,7,8,10,lld4,18,19,20]. 

Given a circuit layout C, the critical area is defined as A{r)D{r)dr 

where A{r) denotes the area in which the center of a defect of radius r must fall 
in order to cause circuit failure and D{r) is the density function of the defect 
size. Defects are usually modeled as circles and the density function has been 
estimated to follow the distribution [7,14,18,20]. Existing methods require 

substantial CPU time for large layouts. They can be summarized into three 
categories: 1) Geometric methods, where A{r) is computed for several different 
values of r independently and then the results are used to approximate Ac- 
The methods to compute A{r) are usually based on shape- expansion followed 
by shape-intersection (see [20] for references). For rectilinear layouts there is a 
more efficient scan-line method [14]. 2) Monte Carlo simulation, where a large 
number of defects is drawn with their radii distributed according to D(r). The 
critical area is estimated by dividing the number of defects causing faults over 
the total number of defects [21]. 3) Grid-based approach, where an integer grid 
is assumed over the layout and the critical radius (i.e., the radius of the smallest 
defect causing a fault at this point) of every grid point is computed. 

For rectilinear layouts our solution to the critical area problem for shorts 
(assuming that defects are isothetic squares) appears in [13]. It was shown that 
the problem can be reduced to computing the 2nd order Lao Voronoi diagram 
of polygons. Consider an arbitrary point p in the layout. The critical radius of p 
for shorts is the radius of the smallest defect which if centered at p overlaps with 
two different polygons. Such a defect causes a short between the two different 
conducting regions represented by the polygons. Thus, the critical radius of p is 
the distance of p from its second nearest polygon. Given the Voronoi diagram of 
the polygons in the layout, we subdivide the Voronoi cell of a polygon P by the 
Voronoi diagram of its neighbors. In other words we obtain the 2nd order Voronoi 
diagram of polygons. A region of an element s (edge or reflex vertex) within 
the Voronoi cell of P is defined as regp{s) = {a; | d{s,x) < d{t,x),Vt G C — P}, 
where C is the given collection of polygons. The critical radius of every point 
X G regp{s) is determined by the distance of x from element s; we say that s is 
the owner of regp{s). 

Having the critical area computation done based on Euclidean Voronoi dia- 
grams would be out of the question in practice due to the difficulty of computing 
the Voronoi diagram of segments. However, the computation in Lac is practical. 
In [13], we showed that once the 2nd order Voronoi diagram is available, the 
critical area for shorts in rectilinear layouts can be done analytically. In partic- 
ular, we showed that the critical area formula can be written as a function of 
the edges of the 2nd order Voronoi diagram. In the following we generalize this 
result to general non-rectilinear layouts. 
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4.1 The Critical Area for Shorts as a Function of Voronoi Edges 

We have a layer in a circuit layout consisting of a collection of disjoint simple 
polygons C in arbitrary orientations. Our goal is to evaluate the integral Ac = 
Jg A(r)D(r)dr, where D(r) = and rg is a minimum optically resolvable 

size. Recall that A{r) denotes the area of the critical region for square defects of 
radius r. The critical region for radius r is the locus of points where if the center 
of a square defect of radius r is placed it causes a short i.e., the defect overlaps 
with two disjoint polygons. 

Let’s assume that we are given the 2nd order Loo Voronoi diagram of C. 
Fig. 7 shows the Loo Voronoi diagram of a set of shapes and Fig. 9 shows the 
2nd order subdivision within a bounded cell. Then Ac = ^c(V) for all (2nd 

order) Voronoi cells V where Ac(V) denotes the critical area within V. Note 
that Ac{V) = A{r, V)D{r)dr where A{r, V) denotes the area of the critical 
region for defect radius r within V . 




Fig. 9. The 2nd order subdivision Fig. 10. The decomposition of V into 
within a Voronoi cell. trapezoids. 



Let’s first concentrate on a single (2nd order) Voronoi cell V having as owner 
an edge e of slope ±s, s > 0. The Voronoi cell of a vertex is considered as the 
Voronoi cell of a vertical or horizontal edge. Consider a decomposition of V into 
trapezoids by drawing lines perpendicular to e (lines of slope ±l/s) emanating 
from the Voronoi vertices of V (see Fig. 10a). Each trapezoid T is further decom- 
posed into orthogonal triangles and at most one rectangle TZ by drawing lines 
parallel to e (slope ±s) through its vertices. (In case T is a slanted parallelo- 
gram continue the decomposition recursively) . We distinguish between two kinds 
of triangles, red and blue, depending on the relevant position of the hypotenuse 
and the orthogonal apex with respect to e. In particular, if the owner e and the 
orthogonal apex lie on opposite sides of the hypotenuse the triangle is colored 
red, otherwise it is colored blue. In Fig. 10b, the lightly shaded triangle is red 
and the darker shaded one is blue. 

Given two vertices Vj and Vk such that Vj is closer to e than Vk , let rj , Vk 
denote the corresponding critical defect radii i.e., the Lao distance of Vj and Vk 
from the line I through e respectively. We derive the following formulas for the 
critical area within an element of the above decomposition. 
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Lemma 4. The critical area within a rectangle TZ, a red triangle T^ed o,nd a blue 
triangle Tuue is given by the following formulas, using the “r^ / r^ ” defect density 
distribution: 

A,{Tred) = ^ {ST In (2) 

2 rj ru 

A,{Tuue) = '^{--ST\nC-^)) (3) 

2 r, rj 

where I is the size of the edge of TZ, Tred and Tuue parallel to e, rk,rj,rk > rj 
are the maximum and the minimum critical radius of their vertices, S = , 

where s is the absolute value of the slope of e, and T = where t is the 

absolute value of the slope of the hypotenuse (i.e, the slope of the corresponding 
Voronoi edge). For s = oo, 5 = 1 and T = t. 

We can derive the critical area within V by adding up the critical areas within 
every rectangle and triangle in the above decomposition of V . Because of the 
summation, terms of the form Sl/ri corresponding to internal decomposition 
edges cancel out. Similarly, for logarithmic terms involving endpoints of the 
decomposition other than Voronoi vertices. Thus, the critical area within V can 
be written as a function of Voronoi edges. Let’s color the Voronoi edges that 
induce the hypotenuse of red triangles as red and those inducing the hypotenuse 
of blue triangles as blue. Voronoi edges that are incident to a rectangle or induce 
an orthogonal edge of a triangle must be either parallel or perpendicular to 
the owner of the cell. Those Voronoi edges that are parallel to the owner are 
referred to as prime and they are portions of bisectors of two parallel edges. 
Prime Voronoi edges are colored red if the interior and the owner of the cell lie 
on opposite sides of the edge; otherwise they are colored blue. Those Voronoi 
edges that are perpendicular to the owner are colored neutral. In the formulas of 
Lemma 4, terms corresponding to red edges get added while terms corresponding 
to blue edges get subtracted. Neutral edges do not contribute in the formulas. 
The boundary of the layout is assumed to be a rectangle whose intersection points 
with the Voronoi diagram are treated as Voronoi vertices. Boundary edges are 
treated as Voronoi edges but they contribute at most once in the formulas. 

We derive the following theorem, which is given without a proof due to space 
limitation. 



Theorem 1. Given the 2nd order Lao Voronoi diagram of polygons of a layer 
in a circuit layout C and assuming that defects are squares following the 'Vq /r^ ” 
defect density distribution, the critical area for shorts in that layer is given by 
the following formula: 



S,h 



A. = rli E E 
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where li and ri denote the length and the critical radius of a prime Voronoi 
edge e,, and rk,rj,rk > rj denote the maximum and the minimum critical radius 
of a non-prime Voronoi edge Cj. The factors Se and are as defined in Lemma 4 
for each owner e of the non-prime edge Cj. B is the sum of the blue terms for 
prime boundary edges and appears as a correction factor since boundary edges 
contribute at most once. 
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Abstract. Given a terrain defined as a piecewise-linear function with n 
triangles, and m point sites on it, we would like to identify the location 
on the terrain that minimizes the maximum distance to the sites. The 
distance is measured as the length of the Euclidean shortest path along 
the terrain. To simplify the problem somewhat, we extend the terrain to 
(the surface of) a polyhedron. To compute the optimum placement, we 
compute the furthest-site Voronoi diagram of the sites on the polyhedron. 
The diagram has maximum combinatorial complexity 0{mn^), and the 
algorithm runs in 0{mn^ log^ m(logm -|- logn)) time. 



1 Introduction 

Problem statement. A (polyhedral) terrain is the graph of a piecewise-linear 
function defined over a simply-connected subset of the plane. It can be repre- 
sented by a planar triangiilation where each vertex has an associated elevation. 
The elevation of any point in the interior of an edge (triangle) is obtained by 
linear interpolation over the two (three) vertices of the edge (resp. triangle). The 
polyhedral terrain is commonly used as a model for (mountainous) landscapes. 

This paper addresses the facility location problem for a set of sites on a 
terrain. More precisely, assume that a set of m point sites on a terrain defined 
by n triangles is given. The distance between two points on the terrain is the 
minimum length of any path between those points that lies on the terrain. The 
facility center of the sites is the point on the terrain that minimizes the maximum 
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distance to a site. We assume throughout that m < n. To avoid complications 
involving the boundary of the terrain, we extend the terrain to (the surface 
of) a polyhedron. We allow only polyhedra homeomorphic to a ball, so that 
their surfaces are homeomorphic to a sphere. We assume that all faces of the 
polyhedron have been triangulated. This increases the combinatorial complexity 
of the polyhedron by at most a constant factor. 

Previous results. In the Euclidean plane, the facility center, or the center 
of the smallest enclosing disc of a set of m point sites, can be determined in 
0{m) time. Several algorithms attain this bound. Megiddo [4] gave the first 
deterministic linear-time algorithm, and a much simpler, linear expected time 
algorithm was found by Welzl [10]. 

In his master’s thesis, van Trigt [9] gave an algorithm that solves the fa- 
cility location problem on a polyhedral terrain in 0{m^n^logn) time, using 
+ n)) space. 

There is a close connection between the facility center and the furthest-site 
Voronoi diagram of the sites. Namely, the facility center must lie at a vertex or 
on an edge of this diagram. In the plane, with Euclidean distance, the furthest- 
site Voronoi diagram has cells only for the sites on the convex hull of the set of 
sites, and all cells are unbounded. 

It appears that on a polyhedron, some of the standard properties of furthest- 
site Voronoi diagrams in the plane no longer hold. For instance, a bisector on the 
polyhedron is generically a closed curve consisting of as many as 0{n^) straight- 
line segments and/or hyperbolic arcs, in the worst case. In general, it may also 
contain two-dimensional portions of the surface of the polyhedron. 

Mount [6] showed that the nearest-neighbor Voronoi diagram of m sites on 
(the surface of) a polyhedron with n faces with m < n has complexity 0{n^) 
in the worst case; he also gave an algorithm that computes the diagram in 
0{n^ logn) time. We do not know of any previous work on furthest-site Voronoi 
diagrams on a polyhedron. 

The problem of computing the shortest path between two points along the 
surface of a polyhedron has received considerable attention; see the papers by 
Sharir and Schorr [8], Mitchell, Mount and Papadimitriou [5], and Chen and 
Han [1]. The best known algorithms [1,5] compute the shortest path between 
two given points, the source s and destination t, in roughly 0(ri^) time. In fact, 
these algorithms compute a data structure that allows us to compute the shortest 
path distance between the source s to any query point p in O(logn) time. 

New results and methods. This paper gives an 0{mn^ log^ m(logm-l-logn)) 
time algorithm to compute the furthest-site Voronoi diagram and find the facility 
center for a set S of m sites on the surface of a polyhedron with n faces. Given 
the linear-time algorithm for finding the facility center in the plane, this bound 
may seem disappointing. However, the algorithm for computing the furthest-site 
Voronoi diagram is near-optimal, as the maximum combinatorial complexity of 
the diagram is 0{mn^). 

2 Geometric Preliminaries 

Previous papers on shortest paths on polyhedra [8, 5, 1,9] use a number of impor- 
tant concepts that we’ll need as well. We review them briefly after giving the 
relevant definitions. 
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In the remainder of the paper P is (the surface of) a polyhedron. As stated 
before, we allow only polyhedra homeomorphic to a ball, so that their surfaces 
are homeomorphic to a sphere. Let 5 be a set of m point sites on P. Consider 
first a single site s G P. For any point p on P we consider a shortest path 
from p to s; note that in general it is not unique. Such a shortest path has a 
number of properties. First, if it crosses an edge of P properly, then the principle 
of reflection holds. This means that if the two incident triangles were pivoted 
about their common edge to become co-planar, then the shortest path would 
cross the edge as a straight-line segment. This principle is called unfolding. For 
any vertex on the polyhedron, we define its total angle as the sum of the angles 
at that vertex in each of the triangles incident to it. The shortest path cannot 
contain any vertex for which the total angle is less than 27 t, except possibly at 
the source p and the target s. 

Any shortest path crosses a sequence of triangles, edges, and possibly, ver- 
tices. If two shortest paths on the polyhedron cross the same sequence (in the 
same order), we say that these paths have the same edge sequence. If a shortest 
path from p to s contains a vertex of the polyhedron, the vertex reached first 
from p is called the pseudoroot of p. If the path does not contain any vertex, 
then site s is called the pseudoroot of p. 

The shortest path map (SPM) of s is defined as the subdivision of P into 
connected regions where the shortest path to s is unique and has a fixed edge 
sequence. For non-degenerate placements of s, the closures of the regions cover P, 
so the portion of P outside any region, where more than one shortest path to s 
exists are one-dimensional. It is known that the shortest path map of a site has 
complexity O(n^); this bound is tight in the worst case. The SPM restricted to a 
triangle is actually the Euclidean Voronoi diagram for a set of sites with additive 
weights. The sites are obtained from the pseudoroots of points on the triangle. 
The coordinates of the diagram sites are obtained by unfolding the triangles in 
the edge sequence to the pseudoroot so that they are all co-planar. The weight 
of a pseudoroot is the distance from the pseudoroot to the site s. It follows that 
the boundaries of regions in the SPM within a triangle consist of straight-line 
segments and/or hyperbolic arcs. For any point on a hyperbolic arc or a segment 
there are two shortest paths to s with different pseudoroots. 

Given two sites s and t on the polyhedron, the bisector /3(s, t) is the set of 
all points on the polyhedron whose shortest path to s has length equal to the 
shortest path to t. The bisector consists of straight-line segments, hyperbolic 
arcs, and may even contain two-dimensional regions. Such regions occur only 
when two sites have exactly the same distance to some vertex of P. For simplicity, 
we assume that these degeneracies don’t occur. 

Fix an integer fc, l<fc<m— 1. For a point p G P, let S{p,k) C S he 
any set of k sites in S for which the shortest path to p is not longer than the 
path from p to any site in S' \ S{p, k). The order-k Voronoi diagram of a set S 
of m sites on P is a planar graph embedded in P that subdivides P into open 
regions such that for any point p in a region, S{p,k) is unique, and such that 
for any two points p and q in the same region, S{p,k) = S{q,k). The interior 
of the boundary between two adjacent regions is an edge of the order-fc Voronoi 
diagram; it is easy to see that each edge lies on a bisector of two sites in S. The 
non-empty intersections of the closures of three or more regions of the order-fc 
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Voronoi diagram are its vertices. We assume that all vertices have degree three; 
otherwise, a degeneracy is present. 

The order- 1 Voronoi diagram is known as the (standard) Voronoi diagram 
or closest-site Voronoi diagram, and the order- (m — 1) Voronoi diagram is also 
called the furthest-site Voronoi diagram. In this paper, we only deal with the 
closest and furthest site Voronoi diagrams, and we give a new algorithm for 
computing the furthest-site Voronoi diagram of a set S of sites on a polyhedron. 
We denote it by FVD(S'), and refer to it more loosely as the diagram. For the 
closest- (furthest-) site diagrams of S, the region 7^(s) of a site s G S' is the locus 
of all points that are closer to (further from) s than to (from) all other points 
in S. 

The following facts are crucial for the algorithm below to work and for the 
analysis to hold. Lemmas 1, 2, and 3 are similar to the lemmas in Leven and 
Sharir [3]; they are general statements about a large class of metrics and hold 
under very general conditions. 

Lemma 1. In the closest-site Voronoi diagram of a set S of sites on P, the 
region TZ{s) of a site s G S is path- connected. 

Lemma 2. Bisector (3{s,f) is connected, and is homeomorphic to a circle. 

Lemma 3. For any three distinct sites s,t,u, bisectors P{s,t), P{s,u) intersect 
at most twice. 

Any family of simple closed curves (in this case, on a topological sphere) every 
pair of which crosses at most twice is called a family of pseudocircles. Thus for 
every fixed s G S, bisectors {(3{s,t) : t ^ s} form a family of pseudocircles. 

Lemma 4. Bisector (3{s,t) consists of 0{n^) straight-line segments and hyper- 
bolic arcs. 

Proof. The Voronoi diagram of {s, t} can be constructed by techniques similar to 
those used by Mitchell et al. [5] for the computation of the shortest path map of 
a single site. By adapting their analysis to the case with two sites, one can show 
that on each triangle of the polyhedron, (3{s,t) consists of 0(n) elementary arcs 
(straight-line segments and hyperbolic arcs) . Summing this over all the triangles 
gives 0{n^). See the paper by Mount [6]. 0 

Since the edges of the closest- and furthest-site Voronoi diagram lie on the 
bisectors of two sites in S, each edge also consists of 0{n^) straight-line segments 
and hyperbolic arcs. To simplify our exposition, the intersections between two 
adjacent segments or arcs on the edges are referred to as breakpoints, as opposed 
to the vertices of the diagram that we defined before. Note that we consider the 
point where a bisector crosses an edge of P to be a breakpoint. 

Lemma 5. The furthest- site Voronoi diagram FVD(S) of a set S of m sites on 
a polyhedron has 0{m) cells, vertices, and edges. 

Proof. Let TZs>t be the region of points that are further from s than from t, for 
s,t G S. In this notation TZ{s) = Cites t^s'^s>t- From Lemma 3 it follows that 
this intersection is the intersection of a set of pseudo-disks. It follows that for 
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each s € S, the region TZ{s) in FVD(S') is connected. So we have at most one 
cell (region) for each site in S, and, by Euler’s relation for planar graphs, the 
number of vertices and edges of FVD(5') is also 0 {m). 0 

We define the total complexity of FVD(S') to be the sum of the number of 
vertices and breakpoints in FVD(S'). 

Lemma 6. The maximum total complexity of FVD(S) is 0 (rnn?). 

Proof. Each edge of FVD(S') is part of some bisector ( 3 {s, t) for two sites s,t G S. 
Consequently, the upper bound follows from Lemmas 5 and 4. 

As for the lower bound, we describe a construction that shows that FVD(S') 
for a set S oi m point sites on a non-convex polyhedron P with n edges can have 
complexity The construction will focus on proving an l7(mn)-bound 

for a single edge of P. It is described for point sites in the plane with obstacles. 
This can then be “lifted” to a non-convex polyhedron. 

First we will describe the location of the sites, then the obstacles. Assume 
that [S'! is even; we split S into and S2 with k = mj 2 points each. Figure 1 
shows the configuration of the sites Si = {si, . . . , Sfc} (in the figure, k = 5 ). For 
ease of description, we also specify two additional points sq and s^+i; these are 
not sites. The sites si, . . . , Sfc G 5i and the points sq and s^+i are placed equally 
spaced on the lower semi-circle of a circle Ci. For 1 < i < fc -|- 1, let bi-i be the 
point where the bisector /3(si_i, Si) meets the upper semi-circle of Ci. Note that 
any point on the arc of the upper semi-circle Ci between and bi is further 
away from Si than from any other site in Si . Let 7^ denote the cone originating 
at site Si that is bounded by the rays ray(si,6i_i) and ray(si,6i). The portion 
of the cone 7^ that lies outside Ci is further away from Si than from any other 
site in ^i. Figure 1 shows just the cones 72, 73 and 74. 

Let £ be a horizontal line lying some distance above the circle Ci. The second 
set of sites S2 = {sj,... ,sj.} is obtained by reflecting the set through i. 
That is, s' is the image of Si from reflecting in i. The points in S2 lie on a 
circle which is the reflection of Ci. The cone 7' is defined analogously and is 
the reflection of 7^. Let ii be the intersection of cone 7^ and i. Note that ii is 
also the intersection of 7' and £. 

We have specified the point sites. Now we will specify the location of the 
obstacles. The important fact is that the cones 7i, . . . , 7fc have a common inter- 
section around the center of circle Ci. Let C2 be a small circle lying within this 
common intersection, and let the segment ab be the horizontal diameter of C2. 
Figure 1 (detail) shows the circle C2 and the segment ab. Let a'b' be the reflec- 
tion of ab through £. Our obstacle set will be the segments ab and a'b' minus a 
few point holes (through which a path can pass). The segment ab has an evenly 
spaced set hi, . . . ,hn of point holes. The segment a'b' also has an evenly spaced 
set h'l, . . . , of point holes; the only difference is that these holes are slightly 
shifted to the left. 

We specified all the points and obstacles. Now, we will argue that the line £ 
is intersected by k = mj^ edges of FVD(5'), each of which crosses £ Tl(n) 
times. Let us focus on the portion £i of the line £. Since any point in £i is 
further away from Si (resp. s') then from any other site in (resp. S2), Si 
and s' are the only interesting sites for £i. We will now argue that /3(si,s') 
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Fig. 1. The configuration of Si and the obstacles in C 2 (detail). 



crosses £ f7(n) times. For 1 < j < n, let pij (resp. p[ j) be the point of intersec- 
tion of the line through Si (resp. s') and hj (resp. /i' ) and the line i. Because of 
the horizontal shift of the holes in a'b', the points occur interleaved on £i as the 
sequence p' 2 ,^, 2 , This is illustrated in Figure 2 for £ 2 - 

For 1 < j < n, since Si can “see” pij whereas s' cannot, there is a neighborhood 
around pij that is closer to Si than to s'. By symmetric reasoning, there is a 
neighborhood around p' ^ that is closer to s' than to Si . It follows that the bisec- 
tor /3(si, s') must cross £i between p' ^ and pij, and also between pij and p' 
Thus, /3{si,s[) crosses £i D{n) times, as illustrated in Figure 2. 




/3(s2,S2) 
£2 



Fig. 2. Detail of /3(s2, s^ . 



One gets Q{kn) = Q{mn) crossings for line £, Q{n) for each £i. The pattern 
can be repeated on n lines parallel to £ and sufficiently close to £. This gives 
Q{rnn) crossings for each of the n lines. The sites and the obstacles can be 
perturbed to a general position without affecting the lower bound complexity. 
By treating the lines as edges on a polyhedron, and ‘raising vertical cylinders’ 
with the obstacles as bases, we can get the Q{mn?) bound for a polyhedron. □ 
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Using standard arguments, and the fact that FVD(S') has maximum total 
complexity 0(rnn?), we obtain the following. 

Corollary 1. Given FVD{S), the facility center of S can be computed in 
0{mn^) time. 

3 Computing the Furthest-Site Voronoi Diagram 

In this section, we describe our algorithm for computing the furthest-site Voronoi 
diagram of the given set S of sites on the polyhedron P. Our algorithm uses ideas 
from the algorithm of Ramos [7] for computing the intersection of unit spheres 
in three dimensions. We first give an outline of the algorithm, and get into the 
details in the subsequent subsections. 

The algorithm for computing FVD(S') works as follows: 

— As a preprocessing step, compute the shortest path map for every site in S. 

— Subdivide S into two subsets R (the “red” sites) and B (the “blue” sites) of 
about equal size. 

— Recursively compute FVD(i?) and FVD(R). 

— Merge FVD(i?) and FVD(R) into FVD(RU R) = FVD(5) as follows: 

• Determine the set of sites Rq C R that have a non-empty region in 
FVD(R), i.e., FVD(i?) = FVD(i?o)- Observe that the remaining sites in 
R\Ro don’t influence the final diagram. Similarly, compute Bq C B. 

• Determine an low-degree independent set M G Rq, which is a subset with 
the property that the region of a site s G M has at most 10 neighbors 
in FVD(i?o)j and no two sites s,s' G M are neighbors in FVD(i?o). 
(Two sites are said to be neighbours if their regions share an edge of the 
diagram.) Compute Ri = Rq\ M and FVD(i?i), and repeat this step 
to generate a Dobkin-Kirkpatrick hierarchy [2] Rq G Ri G ■■■ G Rk and 
their furthest-site Voronoi diagrams, such that Rk has only a constant 
number of sites. Do the same for the blue sites to achieve Bq G Bi G 
■ ■ ■ G Bi and their furthest-site Voronoi diagrams. 

• Compute FVD(i?i U Bi) for 0 < i < k, exploiting the fact that Bi has 
only a constant number of sites. Similarly, compute FVD(i?fe U Bj) for 
0<J<1- This is the basic merge step. 

• Compute FVD(i?iURj) from FVD(i?iURj-i-i) FVD(i?i-|_iURj). This 
is the generic merge step, which when repeated gives FVD(i?o U Bq) = 
FVD(S'). 



3.1 Constructing the Hierarchy for Rq and Bq. 

We describe how to compute the hierarchy Rq G Ri G ... G Rk and their 
furthest-site Voronoi diagrams; for Bq, this is done analogously. The computation 
is similar to the Dobkin-Kirkpatrick hierarchy [2]. Using the fact that furthest- 
site Voronoi diagrams and their dual graphs are planar graphs, we can show that 
there is a low-degree independent set M G Rq such that M contains a constant 
fraction of the sites of Rq. In fact, such an independent set can be found in 
O(|i?o|) time using a greedy algorithm [2]. 



26 



Boris Aronov et al. 



As for the computation of FVD(i?i) = FVD(i?o \ M), we remove the sites 
in M one at a time from Rq and update the diagram after the removal of each 
site. Let s be such a site in M, and let p be a point that lies in the region in 
FVD(i?o) of s. After updating the diagram, p must lie in the region of a site s' 
that is a neighbour of s in FVD(i?o). So the region of s is divided among its 
neighbors, of which there are only a constant number, and all diagram edges 
in that region lie on the bisectors of those neighbors. Computing the bisector 
for two sites takes 0(n^ log n) using the techniques from Mitchell, Mount and 
Papadimitriou[5], and computing the bisector for every pair of neighbors of s 
takes asymptotically the same time. Given these bisectors, we can easily trace 
the edges of the diagram in the region of s in 0(n^ log n) time, using the vertices 
of that region as starting points. 

After all the sites in M have been removed from Rq and FVD(i?i) has been 
constructed, we repeat this procedure to create FVD(i? 2 ),--- ,FVD(i?fc). The 
total number of diagrams we construct this way is O(logm). 

By lemma 6, the size of FVD(i?i) = 0{\Ri\n'^). 

Since Yli=o 1-^*1 ^ geometric series, the total time for computing all inde- 

pendent sets is 0{m). The computation of the bisectors of neighbors and the 
reconstruction of the diagram after the removal of a single site from a diagram 
takes 0{n^ logn) time, and the total number of sites removed is less than m. It 
follows that the construction of the the hierarchy Rq D Ri D . . . D Rk and their 
furthest-site Voronoi diagrams takes 0{mn? log n) time in total. The total size 
of all diagrams constructed is 0{mn?). 



3.2 The Generic Merge Step 

The generic merge step is the computation of FVD(i?iUi3j) from FVD(i?iUBj+i) 
and FVD(i?i+i U Bj), which when repeated gives the required FVD(i?o U Bq) = 
FVD(S'). First some terminology: we call the sites in Ri+i the old red sites, 
and the sites in Ri \ Ri+i the new red sites. Similarly, the sites in Sj+i are the 
old blue sites, and the sites in Bj \ Bj+i are the new blue sites. Now consider 
any vertex v of FVD(i?i ^ Bj). The important fact is that not all three Voronoi 
regions incident to that vertex correspond to new sites; there must be at least 
one old red or blue site whose face is incident to v, because new red (blue) regions 
form an independent set in FVD(i?i) (resp. FVD(Bj)). So to determine all the 
vertices of FVD(i?, U Bj), it suffices to compute the regions in FVD(i?i U Bj) of 
all old red and blue sites. 

Consider an old red site r. The region of r in FVD(i?i U Sj+i) contains all 
points that are further from r than from any other site in Ri U and the 

region of r in FVD(i?i_|_i U Bj) contains all points that are further from r than 
from any other site in Ri+i U Bj. The region of r in FVD(i?i U Bj) is therefore 
the intersection of its regions in FVD(i?i U Bj+i) and FVD(i?i+i U Bj). We 
can compute this intersection for each face of the polyhedron separately by a 
line-sweep of the regions of r in FVD(i?i U Sj+i) and FVD(i?i+i U Bj). The 
time needed for computing the vertices of FVD(i?i U Bj) is therefore bounded 
by O(ClogC'), where C = max(n^|i?i U Sj+i |, U Bj |, U Sj |), which 

in turn equals n^{\Ri\ + \Bj\). Hence, computing the vertices of FVD(i?i U Bj) 
takes 0{n^{\R^\ + |Bj|) log(n^(|i?i| -I- \Bj\))) = O {log n + log m){\Ri\ + \Bj\)). 



Facility Location on Terrains 



27 



The edges of FVD(i?i U Bj) are either edges incident to the faces of old red 
or blue sites (which we already computed), or edges between the faces of two 
new sites of the same color (these edges are sub-edges of edges in FVD(i?i) or 
FVD(i?j), and can easily be found), or they are edges between the faces of a 
new red and a new blue site. For the latter category of edges we already have 
the incident vertices computed, and we can trace the edges after computing the 
bisector of the new red and new blue site. The total number of bisectors we have 
to compute and trace is bounded by |i?iUSj|, so this takes 0{n^ logn(|i?i|-|-|i3j|)) 
time. We conclude that computing FVD(i?i U Bj) from FVD(i?i U Bj+i) and 
FVD(i?i-|_i U Bj) takes 0(n^(logn -I- logm)(|i?i| -|- \Bj\)) time. 

Summing this over all 0 < i < fc, 0 < j < ^ gives 

k i 

0{n^{logn + logm)'^'^{\R,\ + \B^\)) (1) 

i=0 j—0 



We have 



k I k 

t.t.\ba=o(y: \Bq\) = 0{k\Bo\) = 0{mlogm), (2) 

j—Q i=0 

and similarly (1-^*1) O(mlogm). It follows that the total time spent 

in all the iterations of the generic merge step is 0{mn? logm{logm + logn)). 

3.3 The Basic Merge Step 

In the basic merge step, we compute FVD(i?i U Bi) for 0 < i < k, and 
FVD(i?fc U Bj) for 0 < j < I- We will exploit the fact that Bi and Rk con- 
tain only a constant number of sites. We will only describe the computation of 
FVD(i?i U Bi) for a fixed z; all other diagrams are computed similarly. 

1. For each site r G Ri and b G Bi, we compute the region of r in FVD({r, b}). 
To do this, we compute the closest-site Voronoi diagram for sites r and b 
using the 0{n? log n) algorithm of Mount [6]. The region of r in FVD({r, b}) 
is clearly the region of b in the closest-site diagram. The total time for all 
pairs r and b is 0{\Ri\ri^ logn), since there are only 0(|i?i|) pairs. 

2. Next, we compute the region of each site r G Ri in FVD(r U Bi); to do 
this, we intersect the regions of r in FVD({r, 6}) over all b G Bi. Since Bi 
has only a constant number of sites, the intersection can be accomplished 
in 0(n^ logn) time for a single red site r G Rihy iterating the intersection 
procedure in the generic merge step. The time taken for all the sites in Ri 
is 0{\Ri\n'^logn). 

3. Next, we compute the region of each site r G Ri in FVD(i?i U Bi) by inter- 
secting its regions in FVD(rUi?;) and FVD(i?i). The intersection procedure 
is similar to the one in the generic merge step, and it can be shown that its 
running time for all sites in Ri is 0(|i?i|n^ logn). 

4. To complete the computation of FVD(i?i U Bi), it remains to compute the 
regions of the blue sites. Note that all the vertices of FVD(i?iU.B;) are known 
at this stage; these are either vertices on the boundary of the red regions, or 
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they are the vertices of FVD(i?;) that have ‘survived’ after the computation 
of the red sites. The edges of FVD(i?i U Bi) are now traced out just as in 
the generic merge step; in fact, the situation is much simpler here. The time 
taken in this step is 0{\Ri\v? logn). 

Putting everything together, the time complexity of computing FVD(i?iUSj) 
is 0{\Ri\n^ logn). Hence, the time needed for computing all the diagrams in the 
basic merge step is 0{mn'^ logn). 



3.4 Total Running Time and Memory Requirements 

The time for merging FVD(i?) and FVD(R) into FVD(i? U R) is dominated by 
the generic merge step, which requires 0(mn^ log m (log m + logn)) time; the 
total running time satisfies the recurrence 

T(m) = T([to/2J ) + T(|'to/2] ) + Oijnn^ \ogm{\ogm + logn)) (3) 

which solves to T(m) = 0{mn^ log^ ni(logm + logn)). 

The memory requirements of the algorithm are linear in the size of all dia- 
grams that are constructed in the process, which is 0{mn^ logm). 

4 Discussion and Conclusions 

We have shown that the furthest-site Voronoi diagram of a set 5 of m sites on 
the surface of a polyhedron P has complexity 0{mn^), and we have given an 
algorithm for computing the diagram in 0{mn^ log'^ 7n(log m-|-log n)) time. Once 
the diagram has been computed, the facility center, which is the point on P that 
minimizes the maximum distance to a site in S, can be found in 0{mn^) time 
by traversing the edges of the diagram. 

The merge step in our divide-and-conquer approach for the computation of 
FVD(S') is quite complicated, and it would be pleasant to find a simpler method. 
Merging the recursively computed diagrams by sweeping seems natural, but the 
number of intersections of edges of both diagrams can be superlinear (in m), 
while only a linear number of them can end up as a vertex of the resulting 
diagram. 

It would be a challenge to find an output-sensitive algorithm, i.e., an algo- 
rithm that takes time proportional to the number edges/ vertices in the diagram 
plus the number of their intersections with the edges of P. Even more ambi- 
tious would be the computation of the diagram without explicitly representing 
all intersections of Voronoi edges and edges of the polyhedron. 

Another interesting issue is approximation: find (in o(mn^) time) a point 
with the property that the distance to the furthest site is at most (1 -|- e) times 
the radius of the smallest enclosing circle. 

Finally, it is worth investigating if the facility location problem can be solved 
without constructing the furthest-site Voronoi diagram. Recall that the facility 
location problem in the plane can be solved using techniques related to fixed- 
dimensional linear programming [4,10]. 
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1 Introduction 

Given a set B of obstacles and a set S of source points in the plane, the problem 
of finding a set of points subject to a certain objective function with respect to 
B and S' is a basic problem in applications such as facility location problem [5]. 

Suppose that each source point s € S has a positive weights w{s). The median 
problem is to find a point minimizing the following objective function 



where d{s,t) is the length of an obstacle-avoiding shortest path between two 
points s and t. A point t minimizing objective function (1), (2) is called a median, 
center point of S with respect to B, respectively. The set of all median points is 
called the median set of S and the set of all center points the center set of S. 
See Figure 1. In the following, we assume that the distance metric is rectilinear. 

In the absence of obstacles, optimal algorithms for many variants of median 
and center problems have been known. In I-dimensional case, it is well known 
that median set is a point coinciding with a source point or all points between 
two consecutive source points [5] and the median set can be computed in linear 
time. In the plane, the median set can be computed by reducing the problem to 
a pair of 1-dimensional problems [5] . The center problem in the plane can also 
be solved by reducing it to a pair of 1-dimensional problems which can be solved 
in linear time [5,3]. 

In the presence of obstacles, there have been only few algorithms for me- 
dian and center problems. When all source points are in a simple rectilinear 

* This work is partially supported by KOSEF, Grant No. 98-0102-07-01-3. 
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( 1 ) 



and the center problem is to find a point minimizing 

E(t) = maxu>(s)d(s, f), 

sGS 



(2) 
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polygon P, Chepoi and Dragan [2] showed that a median point can be com- 
puted in 0(n + mlogn) time where m is the number of source points and n 
is the number of vertices of P. Given a set of polygonal obstacles, Larson and 
Sadiq [8] gave an algorithm computing a median point. Recently, Kusakari and 
Nishizeki [7] presented an output-sensitive algorithm that computes weighted 
median set in the presence of pairwise-disjoint, axis-parallel rectangular obsta- 
cles. For a small number of source points, the time complexity of their algorithm 
is 0{{n + k) logn), where n is the number of obstacles and k is the complexity 
of the median set. However, no algorithms for the center problem have been 
known in the presence of pairwise-disjoint, axis-parallel rectangular obstacles. 
In addition to the problems we cited, many variants of the rectilinear median 
and center problems have been studied [6,10]. 

In this paper, we present algorithms computing the weighted median set 
and center set of source points in the presence of pairwise-disjoint, axis-parallel 
rectangular obstacles. For a small number of source points, the time complexity 
of our algorithm is 0(nlogn + k) for median set problem, where n is the number 
of obstacles and k is the complexity of the median set. For center set problem, we 
investigate the worst-case size of a center set and present an algorithm computing 
the center set in O(n^) time when the number of source points is fixed. 



• S2 • S2 • S3 




Fig. 1. (a) Median set of 4 source points with respect to 28 rectangular obstacles, 
(b) Center set of 6 source points with respect to 4 rectangular obstacles. 



2 Preliminaries 

Let X,Y be the coordinate axes of For a point p S let Px,Py denote 
the X- and F-coordinates of p. The Li (or rectilinear) distance between two 
points p and q is jjp - q\\i := \px - q^l + \py - qy\- 
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We consider only rectilinear paths in the plane, simply calling them “paths” . 
Such a path is specified by a sequence of points tt = {po^pi, . . ■ ,pk), where pi 
and Pi+i differ in exactly in one coordinate. The path tt has a length £(7 t) = 
\\Pi ~ Pi-i\\i- If is a coordinate axis, we say a path is (p-monotone if 
the (^-coordinate of points along the path is monotone non-decreasing or non- 
increasing. A monotone path is one that is A-monotone and T-monotone. 

The closure of the set, minus the union of the obstacles in B, is called the 
free space FP(S) of B. Points or paths in FP(S) are said to be free. For two free 
points u,v, a shortest path tt{u,v) between them is a free path that has mini- 
mum path length among all possible free paths connecting u and v. The length of 
7 t(u, v) is denoted by d(u, v) and is called the distance between u and v. Consid- 
ering 7 t(u, v), the sum of lengths of its segments parallel to the A-coordinate axis 
is called the x-distance between u and v, and is denoted by dx{u,v). Similarly, 
the y-distance, dy{u,v), between u and v is defined. 

Pyramids For a coordinate direction (p G {±A, ±A}, we call a path a ^path 
if is a monotone path (poiPii ■ • • iPfe) (possibly pk = oo) where po is a source 
point s G S and the path segment [poiPi] is (('-directed and maximal in the sense 
that Pi is either touches some obstacle or goes to oo (then /c = 1). If it touches 
some rectangle B, then [pi,p 2 ] is orthogonal to the coordinate axis \(p\ and p 2 
is the corner of B. The next segment is again directed as (p and is maximal. 
This pattern is repeated alternatively. It is easily seen that [p 2 i-i:P 2 i] (for all 
z > 1) must consistently be directed along some direction <p' . li p = +X then 
p' is either +Y or —Y. We may call the (-l-A)-path an East-North staircase if 
p' = -l-A, and an East-South staircase if p' = —Y. For each source point s G S, 
the (-l-A)-pyramid of B relative to s is the rectilinear convex region bounded 
by the East-North and the East-South staircases. We similarly define the (— A)- 
pyramid, the (±y)-pyramids. The subset of outside of the four pyramids are 
called the monotone region relative to s. 

The basis for our algorithms is the following simple but crucial observation. 
The proof can be derived from the results by De. Rezende et al. [4]. 

Lemma 1. Eor any two points p,q on a line-segment L C EPiB), and a source 
point s, (i) dy{s,p) = dy{s,q), if L is horizontal; (ii) dx{s,p) = dx{s,q), if L is 
vertical. 

Valleys and Ridges For a free point p = {x,y), let fs,B{x,y) = fs,B(p) = d{s,p) 
(or simply fs{x, y), fs{p)) be a distance function between a source point s and p. 
For a horizontal line-segment L in free space, and a point p = [x,b) € L, 
let fs{x) = fs{x,b). For a vertical free line-segment L, a function ff{y) is 
defined similarly. 

A free point p on a horizontal line-segment L is called a valley point {ridge 
point) of s if fg{x) has a local minimum (maximum) at x = Px, respectively. 
Similarly we define a valley, ridge point on a vertical line-segment. Note that 
there is no valley, ridge point in the monotone region relative to s. A set of 
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valley (ridge) points of s makes a line-segment which is called a valley {ridge) 
of s, respectively. See Figure 2. 

Now we briefly explain how to compute ridges and valleys in the (-I-F")- 
pyramid relative to s. We sweep a horizontal line L in (-|-F")-direction from s 
stopping when L overlaps horizontal edges of obstacles in the (-l-F)-pyramid. We 
keep a set of valley and ridge points on L. A binary search tree is used to maintain 
a sequence of the points from left to right on L. The set of data is updated at 
each stop position of L which is each edge of obstacles. The trajectory of each 
valley (ridge) point on L while L moves in (-l-l^)-direction creates a vertical 
valley (ridge), respectively. As moving L in the +Y direction starting from s, we 
can compute all valleys and ridges in 0(n log n) time. We will omit the details 
in this version. 





(a) (b) 

Fig. 2. (a) Valleys (solid lines) and ridges (dotted lines) within the (-I-V)- 
pyramid relative to s. (b) Ridges relative to si, S 5 overlap a cut C of B. 



Define a cut of a rectangle B as an extension of an edge of B through free 
space in both directions until it first encounters other rectangles. A cut may 
extend to infinity. Similarly we define horizontal and vertical cuts of a source 
point. Assuming the general position of rectangles and source points, each cut 
of a rectangle B is partitioned into three parts, called sub-cuts, by B: Suppose 
that C is a horizontal cut of B. One sub-cut is the same as an edge of B and 
it is called the middle sub-cut of C. The other two sub-cuts lie outside of B 
and they are called the left sub-cut, right sub-cut of C, respectively. In Figure 2 
(b). Cm is the middle sub-cut of a horizontal cut C. Left, right sub-cuts of C 
are represented by Ci,Cr, respectively. 

Considering set of ridges and valleys of all source points in S, each valley of 
a source point is always represented by a union of sub-cuts and some ridges may 
overlap cuts. In Figure 2 (b), a cut C is a ridge of si and CiUCm is a valley of S4. 
Furthermore ridges and valleys of source points may partially overlap a sub-cut 
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even we assume the general position of obstacles and source points. In Figure 2 
(b), sub-cut Cm partially overlap valleys of S2, S3, S4 and ridges of si, S5. 

For a sub-cut C, let Sv{C), Sr{C) be a set of source points that generate 
valleys, ridges partially overlapping C, respectively. C is called a valley, ridge, 
null svh-ciit of S if W(C) = X^sgS„(C) ’^i^)~^sGSr{C) '^(^) is positive, negative, 
zero, respectively. 



3 Computing the Median Set in the Plane 

3.1 The Location of Median Set 

By valleys of all source points in S and edges of obstacles, FP(;B) is partitioned 
into subregions. We call the partition the valley-partition, VP (S'), and call each 
subregion a valley-subregion. For a valley-subregion of VP(S), there may be holes 
caused by a presence of obstacles which are called void obstacles. It is easy to 
see that the absence of void obstacles does not cause any change to the distance 
between a free point and any source point in S. Therefore, in this subsection, 
we assume that there is no void obstacle. Thus each valley-subregion of VP (S') 
is a simple rectilinear polygon. 

For a horizontal line-segment L in free space, points that have the minimum 
sum of distances to all source points among points on L are called candidate 
median points on L. Let L = [p, q] be a maximal horizontal line-segment in 
free space where end-points p and q may lie on the boundary of obstacles or 
lie at 00. For L, let pi,p2,. . . ,Pm be intersections of L with vertical valleys or 
ridges of source points except that pi,Pm are end-points of L. Suppose that 
the points are sorted in ascending order of their ^-coordinates. Consider three 
consecutive points pi-\,pi,pi+i on L. Let I = [pi-i,pi+i\. Then S is partitioned 
into four subsets 81,82, S3, 84 such that each source point s G Si has a distance 
function fg{p) from s to p G I that is monotone decreasing in /. Each source 
point in 82 has a monotone increasing distance function. Source points in S3, 84 
create valley, ridge points at pi, respectively. 

For each interval L = [pi,pi+i], called a base-interval on L, the distance 
function fs'{p) between any source point s G 8 to p G C is a, linear function. 
Thus candidate median points of C are pi, Pi+i or all points in L. 

Suppose that pt is an intersection of L with only ridges of source points in 8. 
Then sum of distance functions in the interval I is strictly concave and pi cannot 
be a candidate median point of I. Therefore on a horizontal line-segment L, 
candidate median points lie on intersections of vertical valleys or all points in 
base-intervals whose end-points are intersections of vertical valleys of source 
points in 8. 

Suppose that all points in two consecutive base-intervals L-i, h are candi- 
date median points on L. Then the sum of distances from a point p G I to all 
source points in 8 is 

D{p)= fsix) + ft{x) 

3^82 sG ' S's 3^84 
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= X! w{s){d{p,-i,s) - (x- (k-i)o;)) + 

sGSi 

^ w{s){d{p,-i,s) + {x- (pi-l)a;)) + 

sGS2 

^ w{s){d{p,,s) + \x- {Pi)x\) + ^ w{s){d{pi,s) - |a; - (p*)a;|).(3) 

SG1S3 sG 54 

The first two terms in the right part of Equation 3 are linear in the domain, and 
the third term is strictly convex and the fourth term is strictly concave. Since all 
points in I are median points, we have the condition D{pi-i) = D(pi) = D(pi^i). 
This condition will be satisfied if w(s) = w(s), i.e., the sub-cut 

intersecting L at pi is a null sub-cut. Thus we have: 

Theorem 1. (i) The median set of source points S consists of (1) vertices of 
valley-subregions of VP{S) which are intersections of vertical and horizontal 
valleys of S; (2) edges of valley-subregions of VP{S) which are subset of valleys 
of S; (3) all points in some valley-subregions of VP{S). (ii) Suppose that all 
points in two adjacent valley-subregions of VP{S) separated by a sub-cut C are 
median points. Then C is a null sub-cut. 

From Theorem 1, at least one intersection of vertical and horizontal valleys 
of source points in S' is a median point. Every valley of source points lies on a 
cut of source points and obstacles. Furthermore the median set of S is a union 
of vertices, edges of valley-subregions of VP(S), or all points in some valley- 
subregions of VP(S). Thus the complexity of VP(S) is at most the complexity 
of the partition of free space by cuts of obstacles and source points. Since the 
number of cuts is 0(n -I- to), the upper bound of the worst-case complexity 
of the median set is 0{{n -\- to)^). For the lower bound, we can construct an 
example whose median size is max(n^, tito, to^) = the construction 

is appeared in the full version. Therefore we have: 

Corollary 1. (i) There exists at least one median point among intersections of 
cuts of source points and obstacles, (ii) The worst-case complexity of the median 
set is 0{{n -\- m)^). 

3.2 Computing One Median Point 

Before computing the median set of source points S', we first compute the opti- 
mum value D* of the objective function (1) in Introduction. By Corollary 1, there 
exists at least one median point among the intersections of cuts of source points 
and obstacles. For each horizontal cut L, we compute an intersection point p, 
among intersections with vertical cuts, that has a minimum sum of distances 
to all source points. By Lemma 1, points on L have the same j/-distances to all 
source points. Therefore the point p is the intersection of L with the vertical 
cut having the minimum x-distances to all source points among the vertical cuts 
intersecting L. 
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For every horizontal cut, we do the following to compute efficiently a ver- 
tical cut having the minimum sum of x-distances. We obtain a decomposition 
P{B, S) of free space into rectangular subregions using horizontal cuts of obsta- 
cles and source points. Each subregion of P is called a horizontal strip. With the 
partition P, there is a dual graph G{P), called the horizontal partition graph 
(HPG) of P: Vertices of G{P) are the horizontal strips of P and two vertices 
are adjacent in G{P) iff the corresponding strips in P are bounded by a com- 
mon sub-cut. Assuming the general position of obstacles and source points, the 
maximum degree of vertices in G{P) is four. 

Edges of G{P) are partitioned into three classes: Edges corresponding to 
cuts of source points are called crossing edges. Pairs of edges corresponding to 
left, right sub-cuts of cuts extending bottom (top) edges of obstacles are called 
merging (splitting) edges, respectively. 

Scheme of the Algorithm Initially we compute the set of horizontal, vertical 
cuts of obstacles and source points. Next for each horizontal (vertical) cut C, 
we compute the sum, Dy(G,S) {{Dx(C, S)), of y-distances (x-distances) to all 
source points in S, respectively. With each obstacle B, we associate a height- 
balanced binary search tree (BST), like red-black tree, T(B) whose leaves are 
designed to store (1) the vertical cuts touching the relative interior of the bottom 
edge of B] (2) Dx(G,S) for each vertical cut C. Each internal node v of T(B) 
stores the cut having the minimum sum of distances among cuts stored in the 
leaves of the subtree rooted at v. 

We traverse HPG from the vertex corresponding to the top strip of P in a 
similar way to a depth first search. The differences are: (1) At each vertex of 
HPG adjacent to a pair of splitting edges, traverse the left edge first. (2) When 
we visit a vertex v through a left merging edge from a vertex u, stop traversing 
at V and back to u. 

While traversing each vertex v of HPG, we construct a BST T(v) whose 
leaves store the vertical cuts that pass through the corresponding horizontal 
strip of in P. Suppose now that we are visiting an edge e of HPG first time 
from a vertex u to a vertex r;, we do the following: Let Ce be the corresponding 
sub-cut of e in P. 

(1) When e is a crossing edge, compute a candidate median point p on Cg. The 
point p is an intersection of Ge and the vertical cut C(T(u)) stored in the root 
of T(u). Then D(p) = Dy(Ge, S) + Dx(C(T(u)), S). After the computation, 
we make T(u) to be the tree T(v) of v. 

(2) When e is a left splitting edge, we split T(u) into three trees Ti,Tm,Tr, 
where Tm keeps the cuts in T(u) that touch the relative interior of top edge 
of B. Ti (Tr) keeps the cuts in T(u) to the left (right) of B, respectively. 
Let Ci,Crm Gr be the left, middle, right sub-cuts of a cut of an obstacle such 
that Gi = Ce. We compute candidate median points on Ci , Cm , Cr from cuts 
in Ti,Tm,Tr, respectively, as in case (1). We leave Tr at vertex u so that it 
will be used when the right splitting edge adjacent to u is traversed. After 
the computation, we make T/ to be the tree, T(v), of v. 
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(3) When e is a right splitting edge, we make the tree left at u while traversing 
the left splitting edge of u to be the tree T{v) of v. 

(4) When e is a left merging edge, suppose Ce is the left sub-cut of an obstacle B. 
We compute a candidate median point on Ce from the cut in the root of T{u) 
and compute a candidate median point on the bottom edge of i? as in case 
(1). After the computation, we merge T{u) with the tree T{B) associated 
with B. 

(5) When e is a right merging edge, do in a symmetric way as case (4). 

Then a median point is the point among candidate points on horizontal cuts 
that has the minimum sum of distances to all source points. 

Theorem 2. Given a set of m source points and a set of n rectangle obstacles, 
a median point can he computed in 0{m(rn + n) logn) time. 

3.3 Computing the Median Set 

In this subsection, we present an algorithm for computing median set of source 
points S. In general, by Theorem 1, a median set is a set of rectilinear polygons, 
called median polygon, that may have holes in each polygon. In a degenerate 
case, polygons reduce to isolated points or line-segments. 

Each median polygon P can be partitioned into a set of vertical strips sep- 
arated by line-segments extending vertical edges of P and edges of holes on P. 
As an output of our algorithm, each median polygon is represented by such a 
set of vertical strips. 

Consider all sub-cuts of obstacles and source points. For a horizontal sub- 
cut C, let C+e, C-e be horizontal line-segments of the same length as C and lying 
above, below C apart from C by e > 0, respectively. Similarly, for a vertical sub- 
cut C, let C+e, C-e be vertical line-segments to the right, left of C , respectively. 
For a horizontal sub-cut C, C is called a plane-above sub-cut if Dy{C,S) = 
Dy{C+e, S'). If C is a plane-above sub-cut then, by the symmetry of Theorem 1 
(i), all points in the subregion R of VP(S) bounded below by C has the same y- 
distances to source points. Similarly we define a plane-below, plane-left, plane- 
right sub-cut. 

For each sub-cut C, we check whether it is a valley sub-cut or ridge sub-cut, 
or null sub-cut. If C is a valley sub-cut, we also check whether it is a plane-above, 
plane-below, plane-left, plane-right sub-cut. Note that a valley sub-cut cannot 
be both plane-above and plane-below, or both plane-left and plane-right sub-cut 
by Theorem 1 (ii). 

The basic structure of our algorithm is the same as the algorithm computing 
one median point. We construct HPG of the free space. We associate a BST 
T{u) with each vertex u of HPG. Leaf nodes of BST store the information on 
the following objects that may constitute median regions: (1) A set of vertical 
line-segments which are subsets of vertical sub-cuts intersecting the horizontal 
strip R{u) corresponding to the vertex u; (2) A set of rectangular subregions 
of R{u) bounded by two consecutive vertical line-segments in (1). If the ob- 
ject is a vertical line-segment C, then the i-coordinate of C and Dx{C,S) are 
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stored in the leaf node. If the object is a rectangle R bounded by vertical line- 
segments Ci,C 2 , then an interval [xi^X 2 ] is stored, where Xi is the s-coordinate 
of Ci, for 1 < i < 2. If Cl is a subset of a plane-right sub-cut or C 2 is a subset of 
a plane-left sub-cut, then all points in the rectangle R could be median points. 
(Thus we store Dx{Ci, S) in the leaf node.) Otherwise, points inside of R cannot 
be median points, therefore the value 00 is stored in the leaf node. Furthermore 
in each leaf node v, the y-coordinate of the horizontal cut that bounds above 
median regions containing the vertical line-segments or rectangles represented 
by V is stored in the leaf node. 

BST has a secondary structure to support searching all leaf nodes that have 
the minimum sum of x-distances. By the secondary structure, we can search the 
leaf nodes in 0{K) time, where K is the number of the searched leaf nodes. Fur- 
thermore we can merge two BST’s, and split a BST into two BST’s in O(logTV) 
time, where N is the size of BST. 

From Theorem 1, median regions are bounded by valley sub-cuts. Thus 
we check whether the corresponding sub-cuts of edges of HPG bound median 
regions while traversing HPG. Leaf nodes of BST have the information on the x- 
coordinates of vertical strips or vertical line-segments which constitute median 
regions. The y-coordinates of edges bounding the vertical strips or vertical line- 
segments will be decided while traversing HPG. Suppose that we have the op- 
timal value D* of Equation 1 which is computed by the algorithm in previous 
section. While traversing each edge e of HPG first time from a vertex w to a 
vertex w, if e is a crossing edge, we do the following: Let Ce be the corresponding 
sub-cut of e in VP (S') and Dx{T{u)) be the minimum sum of x-distances of leaf 
nodes in T{u). If Dx{T{u)) + Dy{Ce, S) > D*, then no median point lies on Ce- 
Thus we do nothing. Otherwise, there are median points on Ce and we do the 
following: Note that Ce cannot be a ridge sub-cut. 

(1) If Ce is a null sub-cut, then Ce does not bound below any median region by 
Theorem 1 (ii). Thus we do nothing for Ce- 

(2) If Ce is a valley sub-cut, then there are median points on Ce- Furthermore 
if Ce is a plane-above or plane-below sub-cut, then Ce bounds median re- 
gions. 

(2.1) If Ce is a plane-below sub-cut, then several valley-subregions of VP(S) 
bounded above by Ce are median regions. Thus we set y-coordinates of 
leaf nodes having minimum sum of a;-distances with the y-coordinate 
ofCe. 

(2.2) If Ce is a plane-above sub-cut, then several valley-subregions bounded 
below by Ce are median regions. Thus a set of vertical strips or vertical 
line-segments are generated as an output. Suppose that the object repre- 
sented by a leaf node r of T{u) is a candidate median region and the leaf 
node has a y-coordinate Xy-lir represents a vertical line-segment which is 
subset of some vertical sub-cut C, then a median region created by r is a 
vertical line-segment [(C)a,, {C)x\ x [(Ce)y, r^]. If r represents a rectangle, 
then a median region created by r is a vertical strip [li, li+i] x [(Ce)y, r^], 
where the interval [xi,Xi+i\ is stored for the rectangle in the leaf node. 
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(2.3) Otherwise, the median set lies on Cg- The outputs are horizontal line- 
segments or isolated points on Cg- 

If e is a splitting or merging edge, then we do the similar work as we have 
done when we traverse a crossing-edge; for details, refer to the full version of 
this paper. 

For binary search trees associated with obstacles and the top strip of VP (S'), 
we merge consecutive leaf nodes into one leaf node if they have the same sum of x- 
distances. Furthermore when two BST’s T\, T 2 are merged, the rightmost interval 
leaf node of T\ and and the leftmost interval leaf node of T 2 will be merged only 
if the intervals do not constitute median set. By merging these consecutive leaf 
nodes, we can reduce the number of vertical strips or line-segments which will be 
output of our algorithm to 0{K), where K is the number of vertices of median 
set. Thus we have: 

Theorem 3. Given a set of m source points and a set of n rectangle obstacles, 
the median set can he computed in 0{m(rn + n) log n + k) time, where k is the 
complexity of the median set of S. 

4 Computing the Center Set in the Plane 

Let S be a set of m source points each with a positive weight , and Bhe & collec- 
tion of n pairwise-disjoint axis-parallel rectangle obstacles. In this section, the 
center set of S with respect to B will be computed. Due to the space limitation, 
we will explain only the outline of the algorithm. 

Before explaining our algorithm, we analyze the worst-case complexity of 
the center set. Its lower bound directly came from Figure 1 (b), and its upper 
bound was proved by using some properties about valleys and ridges of S in the 
presence of B. 

Lemma 2. The worst-case complexity of a center set of S with respect to B is 
0{nm), where \B\ = n and jS”! = m. 

Note that the center set is a collection of line segments (possibly, isolated 
points) each of which has a slope of 45 or 135 degrees with respect to V-axis. 

Our algorithm for computing the weighted center of the source points in the 
presence of the obstacles briefly works as follows: (i) draw cuts of the obstacles, 
partitioning the free space of FP(S) into 0{n^) rectangular regions; (ii) for each 
rectangular region R compute the center set (and its radius) of S with respect 
to B when the center set is restricted to locate in R only; and finally, (iii) collect 
the centers with minimum radius. 

Step (i) Draw cuts of all edges of the obstacles in B. This can be done in 
0(n log n) time by vertical (horizontal) trapezoidal decomposition. This results 
in 0{n^) rectangular regions and intersection points, called grid points. We com- 
pute the distances of the shortest paths between all grid points and all source 
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points. Consider a grid point g and a source point s. Construct the monotone 
region and pyramids (including valleys and ridges) relative to s. While con- 
structing (±F)- (resp., (±X)-) pyramids, compute the x- (resp., y-) distances 
to s of vertical (resp., horizontal) cuts and ridges of s in the pyramids. This needs 
0{mn? + mnlogn) time since there are 0{m) pyramids each taken 0(n log n) 
construction time, and 0{mn?) pairs of grid points and source points each taken 
0(1) time for computing shortest distance between them. 

Step (ii) Given a fixed rectangular region R, we want to compute the center 
set (and its radius d|j) of S in the presence of B when the center set is restricted 
to R. For this, we employ the parametric search [9]. Parametric search is an opti- 
mization technique which can be applied in situations where we seek a minimum 
parameter d* satisfying the monotone condition that is met by all d > d* but not 
by any d < d* . The strategy of the parametric search is to design sequential and 
parallel algorithms for the corresponding decision problem: given a parameter d, 
decide whether d < d* or d > d* . If the sequential algorithm runs in Tg time and 
the parallel algorithm runs in Tp time using Wp processors, the parametric search 
results in a sequential algorithm with running time 0{WpTp + TgTplogWp) for 
computing d* . 

In our problem satisfies the monotone condition, so parametric search 
can be used to compute djj. Parametric search requires sequential and parallel 
algorithms for the decision problem: Given a value d, decide whether there is a 
point c in R such that maxggs c?(c, s) < d (i.e., whether d > d*j^). 

For a source point s, let TZs{d) be the region of FP(S) that is reachable from s 
with distance d or less in the presence of the obstacles. Define Rs{d) = ROTZsid) 
Then, c? > c?^ if and only if Dg^sRsid) yf 0. If there is no obstacle in the plane, 
TZs(d) is an Li-metric disk with radius d, which is an axis-parallel square with 
diagonal length 2d rotated 90 degrees. Since obstacles make paths detour, TZs{d) 
in the presence of the obstacles is a subset of that in the absence of the obstacles. 

Consider a region R and a source point s. Rs{d) is defined as a “wedge” whose 
boundaries are two half-rays with slope of 45 and 135 degrees, respectively. Since 
we have computed the distances between all grid points and all source points, 
we can easily decide the shape of Rs{d) (wedge) for each source point s. Now 
we need to f^Rs{d). For this, we use the fact the boundary of Rs{d) consists of 
only two kinds of half-rays; slopes of 45 and 135 degrees. After all, we can com- 
pute p|i?s(d) in 0{m) sequential-time and in O(logm) parallel-time using 0{m) 
processors. Putting these into parametric search mechanism gives 0(mlog^77i) 
time algorithm for computing the center set (and its radius d^) inside R. 

Step (iii) Since there are 0{n^) rectangular regions, in 0{mn^log^m) time 
we can find d* and collect the centers achieving it. 

Theorem 4. Given a set of m weighted source points and a set of n rectangle 
obstacles, the center set can be computed in 0{mn^log^m). 

For unweighted source points, we do not rely on parametric search but can 
develop a rather simple, direct algorithm. 
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Theorem 5. Given a set of m unweighted source points and a set of n rectangle 
obstacles, the center set can be computed in 0{mn^). 

The complexity of the above algorithms seems to be far from the worst-case 
complexity of the center set, i.e., 0(nm). The main difference between median 
and center problems is that whereas at least one point on valleys should be a 
median point, no points of valleys could be a center point. We believe that such 
a difference makes the center problem harder than the median one. 
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Abstract. We study the efficient discovery of word-association patterns, 
defined by a sequence of strings and a proximity gap, from a collection of 
texts with binary labels. We present an algorithm that finds all d strings 
and k proximity word-association patterns that maximizes agreement 
with the labels. It runs in expected time complexity n) 

and 0{k'^~^n) space with the total length n of texts, if texts are uniformly 
random strings. We also show that the problem to find a best word- 
association pattern with arbitrarily many strings is MAX SNP-hard. 

1 Introduction 

Data mining emerged in early 1990’s [1] aims to devise semi-automatic tools for 
discovering valuable “association rules” from facts stored in large scale databases. 
An association rule is an implication between a presumptive condition and an 
objective condition [6] such that if a record (tuple of attribute values or items) 
in a database satisfies the presumptive condition then with high probability the 
record also satisfies the objective condition. To find all qualified rules, a data 
mining tool receives a positive integer support and a ratio confidence from users, 
and searches for any association rule such that the number of records satisfying 
the rule is more than the support and the ratio of this number to the number of 
records satisfying the presumptive condition is larger than the confidence. 

A considerable amount of results has been known for mainly well-defined 
and structured databases, such as relational databases with boolean or numeric 
attributes (e.g. [1,6]). Beside this, recent progress of measuring and sensing 
technologies and network infrastructure has been rapidly increasing the size 
and the species of weakly-structured databases, such as text databases and 
SGML/HTML archives. These lead to potential demands for data mining tools 
for databases where attributes or data structures may not be assumed in ad- 
vance. However, there are still a few results [3,14] in this research direction. One 
difficulty in realizing them might be that a tool should not only process quickly 
to find rules but also construct attributes or structures defining rules in runtime. 
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Our aim is to develop an efficient tool for text databases along the lines of 
data mining. We introduce the framework of optimal pattern discovery, where 
we concentrate to find presumptive conditions that form association rules with 
a given objective condition achieving the maximum confidence. This captures 
frequently appearing situations in data mining research [6] where the objective 
condition is specified in advance, or so simple. The problem is, given a set of 
texts and its binary labeling function, to find a condition whose classification 
agrees with that of the function for the maximum number of texts. We call this 
maximum agreement problem. The complementary version of this problem, the 
minimum disagreement problem, is tied with the computational learning theory 
in the framework of agnostic PAC-learnability [5,8]. 

As a condition over texts, we consider a word- association pattern 
(pi, . . . ,pd; k) formed from a sequence of strings p\,. . . ,pd and a positive in- 
teger gap k. It matches a text if every string occurs in the text in the given 
order proximate within the specified distance k. This extends traditional queries 
called proximity patterns consisting of two strings and a gap distance between 
them. An algorithm efficiently solving the maximum agreement problem for 
word-association patterns can be applied in a wide range of practical problems, 
such as finding a consensus motif from protein sequences in the computational 
biology researches. 

By a naive algorithm, the maximum agreement problem of word-association 
patterns formed from at most d strings is solvable in time with the 

total length n of given texts. However, the practical importance of the problem 
requests more efficient algorithms. Nakanishi et al. [12] showed that if d = 1 and 
no disagreement is allowed then a pattern can be found in linear time. Arimura 
et al. [3] devised the algorithm that finds a word-association pattern whose 
agreement is maximum in 0{m ■ n^) time for d = 2 where m is the number 
of texts. In this paper, we present an algorithm that for every d > 1 finds 
a d-words fc-proximity word-association pattern with the maximum agreement 
in expected running time 0{k'^' ^nlog^^'*’^ n) and ^ n) space, if texts are 

uniformly random strings. Even in the worst case the algorithm runs in time 
logn). These suggest that, for the inputs almost being random strings 
such as the biological sequences, the algorithm runs drastically faster. 

On the other hand, if the number of strings in word-association patterns is 
not limited, the maximum agreement problem is intractable. Dealing with this 
hardness, we should look for a polynomial-time algorithm that solves the problem 
approximately with some guaranteed approximation ratio. We partially clarify 
this issue by presenting the nonapproximability of our maximization problem. 
We show that the maximum agreement problem of word-association patterns 
formed from arbitrary many words is hard to approximate within a factor arbi- 
trarily close to one in polynomial time, i.e., has no polynomial-time approxima- 
tion scheme (PTAS) unless P=NP. It is an interesting contrast that the problem 
is approximable in polynomial-time within a fixed approximation ratio. 

The remainder of this paper is organized as follows. First, we introduce some 
notions and definitions. Next, we present the efficient algorithm that discovers 



Maximizing Agreement with a Classification 



41 



all the best word-association patterns, and analyze its running time. Then we 
prove the nonapproximability of the problem. We end with some discussion on 
the relation between our problem and computational learning theory. 



2 Notions and Definitions 

The set if is a finite alphabet throughout this paper. We always assume some 
fixed total ordering on the letters in S. We refer by e to the empty string in S* 
whose length is zero. For a string s S S*, we denote by s[i] with 1 < i < |s| 
the ith letter of s and by |s| the length of s. The concatenation of two strings s 
and t in S* is denoted by s ■ t, or simply by st. If for a string t there exist 
(possibly empty) strings u,v,w G E* such that t = uvw then we say that u, v 
and w are a prefix, a subword (a substring) and a suffix of t, respectively. Let t 
be a substring of s G E*. An occurrence of t in s is a positive integer label 
1 < j < |s| that specifies a position and a region where in s the substring t 
occurs, that is, an index i such that s[i] • • • s[i -I- |t| — 1] = t. 

Let oo be the symbol of the positive infinity. A proximity word-association 
pattern tt over E {word-association pattern) is a pair tt = (pi, . . . ,pd ;k) of a 
finite sequence of strings in E* and a parameter k called proximity which is either 
a nonnegative integer or oo. A proximity word-association pattern tt matches a 
string s G if* if there exists a sequence ii, ... ,id of integers such that every pj 
in 7T occurs in s at the position ij and 0 < ij+i — ij < k for all 1 < j < L The 
notion {d,k) -pattern refers to a d- words fc-proximity word-association pattern 
(pi, . . . ,pd; k). If the proximity k is oo, then we simply write tt = (pi, . . . ,pd) 
by omitting k. The concatenation of two patterns tt = {a; k) and r = (/3; k) is 
defined by ttt = (a/3; k). 

A sample is a finite set S = {si, . . . , Sm} of strings in E* and an objective 
condition over S' is a binary labeling function ^ : S ^ {0, 1}. Each string Si 
in S is a document and f{si) is a label. Let S be a sample and ^ an objective 
condition associated with S. Then, for a string s G S, we say that a word- 
association pattern tt agrees with ^ on s if tt matches s if and only if ^(s) = 1. 
The notion |S| refers to the number of elements in S. For nonnegative integers i,j 
with i < j, we denote by [i..j] the interval {i,i-\-l,. . ■ ,j}. We define [i..j] to be 
empty if j < i. A d-dimensional axis-parallel rectangle or d-dimensional box is 
a region [ii..ji] x • • • x [id-.jd] with integers ii,ji, . . .,idjd- 

Definition 1. Max Agreement by c?-Words /c-Proximity Association 
An instance is a tuple {E, S,^,d,k) of an arbitrary large alphabet E, a set 
S C E* of documents, an objective condition ^ : S {0, 1} and positive in- 
tegers d and k. A solution is a d-words k-proximity pattern tt over E, and the 
measure ofn is the number of documents in S on which tt agrees with f. The goal 
of the problem is to find a word-association pattern tt maximizing the measure. 

We refer to the enhanced problem where the parameters d and k are not re- 
stricted by simply Max Agreement by Word Association. 



42 



Hiroki Arimura and Shinichi Shimozono 



3 Computing the Maximum Agreement by Association 
of a Bounded Number of Words 

In this section, we describe an efficient algorithm that solves the maximum agree- 
ment problem for (d, /c)-proximity patterns in expected time 0{k^' ^nlog'^’*’^ n) 
and space 0{k‘^~^n) when (1) S' is a set of uniformly random strings with total 
length n over a finite alphabet, and (2) d and k are constants. 

Before looking inside of the algorithm, let us briefly review a key data struc- 
ture, a sujfix tree [10]. Let f be a text of length n, and for 1 < p < n let tp be 
the suffix of t starting at the position p. The suffix tree for t is a rooted tree 
Treat that satisfies the following: (i) Each edge is labeled by a subword w of t, 
with which any other edge from the same node must be distinguished by the 
first letter of their labels; (iii) Each node v represents a subword Word{v) of t 
obtained by concatenating the labels on the path from the root to v in its order. 

Let s be a subword of a string t. The locus of s in Treet, denoted by Locus(s), 
is the unique node v of Treat such that s is a prefix of Word(v) and W ord{w) 
is a proper prefix of s, where w is the parent of v. The subword w on an edge of 
Treat is referred by the occurrence of w in t. 

It is clear that Treat has exactly n leaves and at most n—1 internal nodes, and 
can be represented in 0{n) space. McCreight [10] gives an elegant algorithm that 
computes Treet in linear time and space. The height of Treet is the maximum 
number of nodes on a path from the root to a leaf. The expected height of suffix 
trees is known to be very small for a random string [4]. 

By assuming all edges from the same node are lexicographically ordered, 
the ith leaf U for 1 < i < n in some appropriate traversal represents the suffix 
of rank i in the lexicographic order. Let Ap^ , Ap^ , ■ ■ ■ , ^p„ be the sequence of all 
the suffices of A in lexicographic order. Then, we store the indices Pi,P 2 , ■ ■ ■ ,Pn 
in an array suf of length n in this order, and define pos to be the inverse array 
of suf of length n, that is, pos{suf{x)) = x for all index 1 < a; < n. These arrays 
are called the suffix arrays [9]. 

Let {E, S, if, d, k) be an instance of the maximum agreement problem. Obvi- 
ously, the maximization of the agreement equals the maximization of the measure 
A = count i— county, where counta = ]{sG S' | “tt matches s” and f (s) = a } | 

for each a G {0,1}. In Fig. 1, we show the algorithm. 

Let $ ^ 27 be a special delimiter symbol such that $ yf $. Given a sample 
S = jsi, . . . , Smj, the algorithm builds a single string A=si$---$Sm$of length 
n = |A| and the suffix tree TreCA for all strings padded in A in linear time. For 
1 < p < n, we define 6{p) = iiip belongs to the fth document Si £ S. A pattern tt 
is in canonical form if it can be expressed as tt = (W ord{ui ) . . . , Word{ud)', k) 
for a sequence of nodes ui, ... ,Ud of Tree a. Here, if Uj is a leaf then we define 
Word{uj) to be the longest prefix of the subword represented by uj that does 
not contain $. If Uj is an internal node then we can see that Word{uj) does not 
contain $. This is because $ cannot match any symbol including itself, and thus, 
every node corresponding to $ must be a lowest branching node. Let Tg £ E* be 
an arbitrary string whose length is maxj |s| | s G S'} -I- 1. Clearly, zis_j(Ts) = 0. 
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Algorithm. BoundedJA aximum_Agreement-. 

Input: An alphabet E, an integer A: > 0, a sample S C E* and an objective 
condition ^ : S — > {0, 1}. 

Output: A (d, fc)-proximity pattern tt over E that maximizes Z\5^5(7r). 

1 Let A = si$---$Sm$ be the input text associated with S. Compute the 
suffix tree TrecA for A, the suffix arrays suf and pos = suf~^. 

2 Let Diagd,k,s be the d-dimensional diagonal set of width k. Then, com- 
pute Rankd,k,s = { (xd, ■ ■ ■ , xi; S) | Xj = pos(ij) for every 1 < j < 
d and (id, . . . , ii; <5) G Diagd,k,s } by transforming the labeled points in 
the position space Diagd,k,s into those in the rank space Rankd,k,s using 
the suffix array pos. 

3 Call Discovered, Rankd.k.s , pd) with the empty pattern pd := {',k). 

4 Return the best patterns tt found if the maximum of Zi is positive. Other- 
wise, return J_s. 



Fig. 1. The algorithm for computing the maximum agreement over (d, k)- 
proximity patterns 



Lemma 1. is maximized over all (d,k) -proximity patterns by either a 

pattern n in canonical form or J_ 5 . 

This can be shown by the discussion similar to that for d = 2 in [3] . 

By Lemma 1, it is sufficient to consider only the patterns in canonical form. 
The idea is that to search and test such patterns, we reduce the search of d- 
words patterns to that of d-dimensional boxes over the space formed by the 
lexicographically ordered suffices. This allows an average case efficient algorithm 
with random inputs by visiting only a small fraction 0{h‘^N) of patterns from 
0(n‘^) possible combinations of d substrings of A, where h is the height of the 
suffix tree for the input text and N = k‘^~^n is the number of points in Diagd,k,s- 

The d-dimensional diagonal set of width k, denoted by Diapd k s — 

[1..to], is the set of all the labeled d-dimensional points (fi, . . . , id] S) such that 
for ig = 0,id+i = n and some 0 < S < m, every ij with 1 < j < d satisfies 
0 < ij+i—ij < k and 6{ij) = 6. Let Rankd,k,s C [l..n]‘^x[l..m] be the set of all the 
labeled d-dimensional points {xd, . . ■ ,xi]S) such that {id, ■ . . ,ii]S) € Diagd,k,s 
and Xj = pos{ij) for every 1 < j < d. We call Diagd,k,s and Rankd,k,s the 
position and the rank spaces. 

We can see that for any node v of the suffix tree Tree a, the set of the 
occurrences of Word{v) form a consecutive subinterval I{v) = [X 1 ..X 2 ] of the 
rank space [l..n]. Then, we relate each (d, fc)-word association pattern tt = 
{Word{vi), . . . ,Word{vd)] k) in canonical form with the d-dimensional box 
Box{n) = I{vi)x ■■■xl{vd) over Rankd,k,s- For any set Q C [l..n]^ x 
we define the measure of box B{tt) by Aq^^{B{tt)) = count'i — county, where 
count'^ is the number of the indices 5 S such that {x\, . . . ,Xd]d) S 

Q n {Box{tt) X [1..to]) and ^{6) = a for each a G {0, 1}. The following lemma is 
essentially due to Manber and Baeza-Yates [9]. 
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Procedure. Discovered, Q, pd)- 

1. The case that d > 1: 

(a) Compute the set Acoord of the first coordinates of the points in Q and 
sort it. Let Lf := 0 and JTnew := 0 - 

(b) For each coordinate x G Acoord do: 

Attach the set Qx,, = { [xd-i, . . . ,xr,5) \ {xd, Xd-i, . . . ,xi\S) G Q } 
to the leaf \x of rank x, and then append Xx to the right end of H. 

(c) For each level I — 0,1, . . . ,h do: 

(i) Scanning H from left to right, for each v G H do: 

Let w be the parent of v. Then, update Q™ by Q™ := Qw U Qx 
in sorted order in the (d — l)st coordinate (in h if d = 1). If v is 
the leftmost child of w then append w to the right end of ddnew 
If f > 0, then recursively call Discover (d — I, Q^, Pd-i) with 
Pd-i ■■= pd-{Word{v)-,k). 

(ii) Discard Q„ for all nodes v G H. Let H ddnew and Idnew := 0 - 

2. The case that d = 0: 

Compute counta ■= | {d £ <3 I C(d) = q} I for each a G {0, 1}. If A := 
counti — counto is larger than the largest value seen so far, record the 
pattern po. 



Fig. 2. The procedure Discover 



Lemma 2. Let S be a sample and Q = Rankd,k,s be the set defined above. 
Then, = Aq^^{Box{tt)) for any {d,k)-pattern tt in canonical form. 

Proof. Assume that a (d, fc)-pattern tt = [pi, . . . ,pd',k) matches some docu- 
ment ss € S. By definition, we can see that [ii,...,id) € is the vector of 

the occurrences of the components pi, . . . of tt if and only if {ii, . . . , id] d) S 
Diagd,k,s and for every 1 < j < d, index ij is an occurrence of pj in A. Since ij 
is an occurrence of pj if and only if pos{ij) G I{vj) where Vj = Locus{pj), 
this is equivalent to that (pos{ii), . . . ,pos{id)]S) G I{vi) x • • • xl{vd) x = 
Box{tt) X provided that (ii , . . . ,id]S) G Diagd,k,s- Thus, we know that tt 

matches G S' if and only if Rankd,k,sPBox{T:)x\l..m] contains a point labeled 
by the document name 5. □ 

Fig. 2 shows the subprocedure Discover that recursively computes best boxes 
Z\g^^(i?oa;(7r)) by making a series of orthogonal range queries directly over the 
suffix tree with dynamic programming. 

Theorem 1. Let E be an alphabet, d and k are fixed integers, S be a sample, 
and ^ be an objective function for S . Then our algorithm computes all the (d, k)- 
patterns in canonical form that maximizes the measure over {d, k) -patterns in 
time 0{k‘^~^ h‘^n log n) and space 0{k^~^n) in the worst case, where n is the 
total length of strings in S and h < n is the height of the suffix tree for A. 

Proof. Given a sample S, the algorithm computes A, TreCA, Diagdks and 
Rankd,k,s- Qd = Ro,nkd,k,s- By Lemma 2, the maximum agreement problem 
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reduces to finding a sequence (ui, . . . ,Ud) of nodes of TrecA such that the d- 
dimensional box B = I{vi) x • • • x I{vd) maximizes 

Let Q be any subspace of c-dimensional rank space for c < d. In what follows, 
we write {xc, . . . ,xi) for a c-dimensional point in Q and we denote by Q\p{xc,...,xi) 
the subspace of Q consisting of all points {xc, ■ ■ ■ ,xi) G Q that satisfies the 
condition P{xc, ■ . . ,x\). 

Initially, Discover is invoked with (d, (; fc)). When Discover is invoked 

with (c, Q, p) for some c < d, the procedure computes the subset Qy using 
dynamic programming by visiting each node v from the leaves to the root. Then 
by induction on the height of v, Qy is exactly the subset of Q consisting of the 
points with the c-th coordinate Xc is in /(u), i.e., Qy = Q\xcGI{v)- 

Let Qc be any c-dimensional rank space and By = Qx ■■■ x Ii he any box 
over Qc, and let Qc-i and By-i = Ic-i x • • • x/i be their projections to (c — 1)- 
dimensional rank space. Then, if Xy G I{v) for all (xy, . . . ,xi) G Qy then the 
following equivalence holds: Aq^^^(Bc) = Aq^_,^^^{Bc-i). Thus, we can reduce 
the problem to that in the lower dimension in this way. 

From these observations, it is not hard to see that whenever Diseover is in- 
voked with (c,Qy,pc) for pc = {vd, ■ ■ ■ ,Vy+i), Qy equals Qd\xjeiiva),...,xa+iei{v,+i)- 
Thus, when the computation reaches c = 0, Qo becomes the set of the dis- 
tinct labels in Qd n {I{vd) x ■ ■ ■xl{vi) x [l..m]), and Step 2 correctly computes 
AQd,^{I(vd)x . . ,xl(vi)). Hence, now we see that Discover computes the maxi- 
mum of As,^ by visiting all boxes that has nonempty intersection with Rankd,k,s- 
Next, we consider the time complexity of Discover. Let N = k‘^~^n be the 
cardinality of Diapd k s and, thus, of Rankd,k,s- At Step 1 of the main algorithm, 
we compute TreeA and pos,suf in 0{n) time using the linear time algorithm 
of [10]. Step 2 is computable in 0{N). We assume each Qy is implemented by 
an appropriate data structure that allows insertions in O(logn) time. At Step 3, 
the computation time Td{N) of Discovered, Qd, Pd) with N = \Qd\ is estimated 
as follows. If d = 0 then we can easily see Tq{N) = 0{N). Suppose that d > 1. 
Let Hi be the set of nodes in level I, and for each v G Hi, let N{v) = |Q„|. Then, 
we have the recurrence 



Td{N) = 0{N) + E E N(v) logN{v) + E E Td-iiN{v)) 

l<l<hvGHi l<l<hvGHi 

with the constraint J^yeHi A^(^) = N for all 1. The first term is for Step (a) 
and Step (b), the second term is for merging Qy’s and the third term is for 
computing Discover{d— l,Qy, pd-i). Solving this recurrence, we have Td{N) = 
O {dh^N log N). This completes the proof. □ 

Corollary 1. Let S be a set of uniformly random strings with total length n over 
a finite alphabet. Then the problem Maximum Agreement by d-WORDS k- 
Proximity Association is solvable in expected time 0{dk'^~^nlog‘^~^^ n) and 
space 0{k‘^~^n), where {S,S,^,d,k) is an instance of the problem, and n is the 
total length of strings in S . 

Proof. Let A be a uniformly random string of b letters of length n over A U {$}. 
Let Hn be the random variable that gives the height of the suffix tree for A of 
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length n. For every h > 0, Devroye et al. [4] gives the bound P{Hn < h) < 
/ {1 — q) + nq^^) on the probability that Hn < h, where q = (1/6)^/^. 
Then, the average running time is E(T(n)) = ’ P{Hn = h), where 

the addition is taken over all A of length n and T{n,h) = dk'^~^h^n\ogn). By 
calculating the addition separately for the cases Hn < 2 logj n and Hn > 2 logj n, 
we have the expected running time E(r(n)) = 0(d/c‘^“^nlog'^~'’^ n). □ 

4 Hardness of Approximating Maximum Agreement by 
Association of Arbitrary Many Words 

As we have seen, Max Agreement by c?-Words fc-PROxiMixY Association 
for fixed d and k is solvable in polynomial time. However, the problem has seemed 
to be computationally intractable if d and k are not limited. This observation 
comes from the NP-completeness of a consistency problem of a class of regular 
patterns [11] in computational learning theory. Although the proof of this result 
does not capture the hardness of the optimization, it tells us that the maximum 
agreement problem of word-association patterns is intractable if the optimum is 
very close to the total number of texts. 

Consider the following trivial algorithm: Given an instance, simply count 
the numbers of positive-labeled strings and negative-labeled strings and choose 
the better one from either an empty pattern accepting all labeled strings or a 
pattern rejecting all strings (for example a string longer than all given strings). 
This algorithm clearly approximates the problem with a guaranteed factor 1/2, 
and thus Max Agreement by Word Association is in class APX [2]. 

On the other hand, it is hard to approximate the problem within an arbi- 
trarily small error in polynomial time. The main result in this section is the 
following theorem. 

Theorem 2. Max Agreement by Word Association is Max SlAP-hard. 

Proof. We prove this by an L-reduction [13] from Max 2-Sat. We build for a 
Max 2-Sat instance (j> = (A, C) of a set X = {x\, . . . , Xn} of variables and a 
set C = {ci,...,Cm} of 2-literal clauses, an instance tt = of Max 

Agreement by Word Association. 

We firstly define an alphabet Ai = {oi, 6^} for every 1 < i < n -|- 1 where n 
is the number of boolean variables |A| in </>. Let bin{i) for 1 < i < n be the 
binary representation of i in by some natural coding. We denote 

by symbols ti,fi,Ui,di the strings aiaibi,biaiai,aibiai,biaibi in {AiY for all 
1 < i < n, respectively. Note that any word-association pattern that accepts 
both dj and cannot reject either or f^, or both. Let Si for 1 < i < n be a 
set either {t^, f u^, d^}, {ti,Ui,di} or {fi,Ui,di}. Then the notion [S'i] refers a 
pattern that accepts strings in Si and rejects {t^, f u^, d^} — Si. For example, 
[{ti,di,Ui}] is a pattern {aihi) or (ai,bi), which accepts ti,di,u but rejects f^. 

Next, for each clause Ci € C, we associate the following negative-labeled 
string: For each j from 1 to n, we concatenate either (i) d^ if Xj and Xj are not 
in Ci, (ii) ij if the literal Xj appears in c,, or (iii) tj if xj is in c^. Then to its 
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end we append bin{i). For example, we associate with a clause Ci = {xj, Xk) with 
1 < j < fc < n the string di • • • f j • • • d„ • bin(i). Note that we are assuming 

no clause in C has two complementary literals. Such a clause can be handled as 
the auxiliary additional value to the measure in the reduction. 

For each clause c^, we associate two positive-labeled strings di • • • d„ • bin{i) 
and ui • • • u„ • bin{i). We refer to these by down-string and up-string, respectively. 
The labeled sample corresponding to cj) is defined as the set of the negative- 
labeled strings and the positive-labeled strings associated with all clause in C. 

Now we specify the translation from a solution p for {A^, T^, to a Boolean 
assignment Bp for </>. Let Si for 1 < i < n be either {ti,Ui,di} or {fi,Ui,dj}. 
Then, a boolean assignment-pattern p for {A^, T^j,, is a word-association pat- 
tern [^i] • [S' 2 ] [S'n] • where denotes a pattern 

that accepts all strings in By regarding {ti,Ui,di} and {fi,Ui,di} as 

‘true’ and ‘false’ assignment to Xi respectively, an assignment pattern rejects a 
negative-labeled string if and only if the corresponding clause is satisfied. This 
fact is important since the following lemma holds: 

Lemma 3. For any solution p o/tt = {A^,T,p,f^), we can compute in logspace 
an assignment-pattern p' for tt whose measure is larger than that of p. 

We show a brief outline of a proof of this lemma. Without loss of generality, we 
can suppose that a solution p accepts both some up-strings and down-strings and 
achieves the measure larger than 2/3-\T^\. Otherwise we simply take the empty- 
pattern ( ) as p. Then, such a pattern p can be decomposed into the (possibly 
empty) subsequences pi formed from only letters in Ai for all 1 < i < n -I- 1, 
since p must accept all the positive-labeled strings except those rejected by 

If we modify the last segment [p„+i] to accept more positive-labeled strings, 
at most one negative-labeled string will be accepted for each pair of newly ac- 
cepted up-string and down-string ending with bin{i). Thus we can modify Pn+i 
to pattern that allows to accept all positive-labeled strings. Now let pi be one 
of the subsequences of p accepting both and f i. We replace every such subse- 

quence Pi with either [{di,Ui,ti}] or [{d^, u^, f fy]. Each choice can be done even 
arbitrarily, and every such replacement at least retains the number of rejected 
negative-labeled strings. Thus we obtain an assignment-pattern p' from p. 

The Lemma 3 tells that the measure of the optimum solution of tt is given 
by some assignment pattern, and a boolean assignment-pattern p corresponds 
to the boolean assignment, say Bp, to X with respect to the formula 4>. Then, 
it is immediate that the following claim hold: 

Claim. For every 2-CNF formula </>, there holds opt^n) < 5 • opt{(f). For every 
assignment-pattern p, there holds opt{f>) — m{<f>,Bp) = opt{Tr) — m{'K,p). 

The first statement is true because, for any 2-CNF formula <j), always opt{(j>) > 
\\C\ [7] and opt{n) = 2 • |C| -I- opt{(f) hold. The second statement is clear since 
with an assignment-pattern all the positive-labeled strings are accepted and 
thus only the number of rejected negative-strings corresponding to the satisfied 
clauses makes a difference between opt{n) and m{n,p). □ 
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Corollary 2. There is no PTAS for Max Agreement for Word Associa- 
tion Patterns unless P = NP. 

5 Discussion 

Agnostic PAC-learning [8] is a generalization of a well known PAC-learning 
model in computational learning theory, where the learner has no information 
about the structure behind the target function, or training examples may contain 
arbitrary large percent of noise. Kearns et al. [8] gives a characterization that for 
any hypothesis class of low VC- dimension, the polynomial time solvability and 
the efficient agnostic PAC-learnability for the class are equivalent. Although 
the problem is intractable in most cases, recently some geometrical patterns, 
e.g. axis-parallel rectangles and convex k-gons on Euclid plain, are shown to 
be efficiently agnostic PAC-learnable by this characterization [-5]. Since word- 
association patterns obviously have polynomial VC-dimension, we know from 
Theorem 1 that our algorithm in Section 3 works as an efficient agnostic learning 
algorithm for d-words fc-proximity word-association patterns. To our knowledge, 
this is one of the first results on the efficient agnostic learnability for nontrivial 
concept classes other than geometric patterns. 
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Abstract. In this paper, we study issues on disjunctions of propositional 
Horn theories. In particular, we consider deciding whether a disjunction 
of Horn theories is Horn, and, if not, computing a Horn core, i.e., a 
maximal Horn theory included in this disjunction. The problems are 
investigated for different representations of Horn theories, namely for 
Horn CNFs and characteristic models. While the problems are shown 
to be intractable in general, we present polynomial time algorithms for 
bounded disjunctions in the formula-based case. 



1 Introduction 

Since deduction from a set of propositional clauses is a well-known co-NP- 
complete problem, different approximation methods for reasoning from a clausal 
theory have been investigated, e.g., [10,13,3,4,1]. One of these approaches [10] 
uses a Horn greatest lower bound (also called Horn core [11]), i.e., a maximal 
Horn theory ifc C if, if we view theories as sets of models, and the least Horn 
upper bound (Horn envelope [11]), which is the minimal Horn theory D S, 
i.e., ife C E' for any Horn theory E' such that E' D E. Computing Horn en- 
velopes and Horn cores has been investigated in [10,11,2,4,8,6,1]. It has been 
shown that a Horn core of a given CNF formula ip is computable in polynomial 
time with an oracle for NP [2] , and that computing a maximum (in terms of the 
numbers of models) Horn core is co-NP-hard [11], if the theory E is given by 
the set of its models. 

In this paper, we consider the issue of computing Horn cores of the dis- 
junction E = Ei of Horn theories Ei, represented either by Horn CNFs, 
or by their characteristic models [9]. Characteristic models have been proposed 
as a model-based alternative to formula-based theory representation. The two 
approaches are orthogonal with respect to space requirements, i.e., the one ap- 
proach sometimes allows for an exponentially smaller representation than the 
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other; see [9,12]. Observe that a disjunction S of Horn theories is in general not 
Horn. Hence, it is of interest whether S is Horn, in particular, since then the 
Horn core and the Horn envelope coincide to S. 



Disjunctions of Horn theories may be encountered in different applications. 
For example, suppose that two groups have respectively formed logical hypothe- 
ses about an application domain (the “world”), e.g., relationships between med- 
ical tests and diseases, and they believe the relationships amount to a Horn 
theory. The hypotheses Si and S 2 of two groups (amounting to sets of mod- 
els) are obtained from actual and conjectured cases, respectively, i.e., concrete 
measurement data (obtained by experiments) and data which are believed to be 
true. Suppose the hypotheses Si and S 2 are merged. Then, at the logic level, 
the disjunction, i.e., union S = Si U S 2 describes the merged hypotheses. It 
is of particular interest to know whether S is Horn; if so, then the hypotheses 
are compatible in the sense that no further cases have to be adopted in order to 
preserve the Horn property, which may indicate that the individual hypotheses 
are sound. However, if S is not Horn, then either further cases have to be added 
to maintain the Horn property (which corresponds to derivation of new case 
knowledge), or some of the adopted hypothetical cases have to be abandoned. 
Applying Occam’s razor, it is natural to add or to abandon a minimal set of 
cases. In the former case, this amounts to finding the Horn envelope of A, and 
in the latter case, to finding a Horn core U of S. In finding a Horn core 77, it 
may be also asked to add a constraint such that 77 includes all actual cases Si, 
i.e., computing a Horn core 77 satisfying C 77 C 17 can be seen as one of the 
interesting and important problems. 



In this paper, we first address whether a disjunction S = Si of Horn 
theories Si, represented either by Horn CNFs or by their characteristic models, 
becomes a Horn theory. We show that checking this property is co-NP-complete 
for both representations in general, but is polynomially solvable if I is bounded 
by a constant. We next deal with the problem of computing a Horn core 77 of 77 
satisfying Si C II C S, which has arisen in the above example. We show that, 
if I is bounded by some constant, then the problem is polynomially solvable from 
Horn CNFs, while it is co-NP-hard from characteristic models, even if Z = 2. For 
the formula-based representation, we present a polynomial time algorithm CORE 
to compute a Horn core of the disjunction of two Horn theories 77i and 772. We 
also develop an algorithm CORE* for computing a Horn core of the disjunction 
of ^ (> 3) Horn theories. This algorithm is polynomial if I is bounded by some 
constant. Furthermore, we give structural characterizations of Horn cores of a 
disjunction of Horn theories. As for the Horn envelope (which is always unique), 
we show that it is polynomially computable from characteristic models, but not 
from Horn CNFs, even if / = 2. 



Due to space constraints, most proofs are omitted. They are given in [7]. 
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2 Preliminaries 

We assume a supply of propositional variables (atoms) x\,X 2 , ■ ■ ■ ,Xn, where 
each Xi evaluates to either 1 (true) or 0 (false). Negated variables are denoted 
by Xi - These Xi and Xi are called literals. A clause is a disjunction c = iiV ■ ■ - W ik 
of literals, while a term is a conjunction A • • • A£fc of literals. By P{c) and N{c) 
(resp., P{t) and N{t)) we denote the sets of variables occurring positively and 
negatively in c (resp., t); _L (resp., T) is the empty clause (resp., empty term), 
i.e, falsity (resp., truth). A conjunction of clauses (p = f\^Ci (resp., a disjunction 
of terms = \J^ti ) is called conjunctive normal form (CNF) (resp., disjunctive 
normal form (DNF)). 

A model is a vector v S {0, 1}", whose i-th component is denoted by A 
theory is any set S C {0, 1}" of models. We denote hy v < w the usual bitwise 
ordering of models, where 0 < 1. A model n € A is minimal in A, if there is no 
w G S such that w < v. For B C {1,2, .. . ,n}, we denote by x^ the model v 
such that Vi = 1, if i G B and Vi = 0, if i ^ B. 

For any formula (p, let T{(p) = {n € {0, 1}" | (p{v) = 1} be its set of models. 
A theory E is represented by (p, if T(ip) = E. If unambiguous, we often do 
not distinguish a formula from the theory it represents. For any ip and tp, we 
write (p < Ip if T{p) C T{ip). A nontautological clause c (resp., noncontradictory 
term t) is an implicate (resp., implicant) of a theory E if c(y) = 1 for all v G E, 
(resp., t{v) = 0 for all v ^ A); it is prime, if no proper subclause (resp., subterm) 
is an implicate (resp., implicant) of E. 

A theory E is Horn if E = Cl/,,{E), where Cl/,,{S) is the closure of S' C (0, 1}” 
under bitwise AND (i.e., intersection) of models v,w, denoted by v f\w. Any 
Horn E has the least (unique minimal) model, given by where 

for S C {0, 1}" is the intersection of all models in S. A clause c is Horn if 
|-P(c)| < 1, and a CNF is Horn if all its clauses are Horn. As well-known, a 
theory E is Horn iff it is represented by some Horn CNF. Moreover, all prime 
implicates of a Horn E are Horn. 

A Horn theory E^ is a Horn core of a theory E [10,11], if C A and no 
Horn theory A' exists such that Ec d E' C_ A. Observe that, in general, A 
has more than one Horn core; e.g., A = {(110), (101)} has two Horn cores 
Ep = {(110)1 and Ep = {(101)}, respectively. The Horn envelope of A [10,11] is 
the Horn theory Ag D A such that no Horn theory A' such that Ag D A' D A 
exists. For the above A, Ag = {(110), (101), (100)}. As easily seen, the Horn 
envelope is always unique. Let p he & formula representing a theory A, and 
let Pc and pe be formulas representing a Horn core and the Horn envelope of A, 
respectively. Then pc and pc are also called a Horn core and the Horn envelope 
of p, respectively. 



3 Formula-Based Representations 

Our first result is that deciding the Horn property for a disjunction of Horn 
CNFs is intractable in general. 
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Theorem 1. Given Horn CNFs (pi, (p2, ■ ■ ■ ,Pi, deciding if <p = Vi=i Horn 
is co-lSlP -complete. □ 

The reduction in the hardness part of the proof of this theorem uses a number 
of Horn theories which depends on the instance of the reduction. For the case 
where I is bounded by some constant k, we have a positive result. We need 
further notation first. 

Let (pi = Cij, i = 1 , 2 ,. .. , 1 . Denote by HC{p\,p2, ■ ■ ■ ,Pi) the set of 

all nontautological Horn clauses c such that N{c) = Ui=i and P(c) = 
P{ci,ij) for some i, where ij S {1, 2, . . . , rui] selects one clause for each i. 

Example 1 . Let p\ = (xi V X3 V X4){x2 V T5 V Xq) and p2 = (i^i V X2 V a;3)(xi V 
X3 V Xq). Then HC{pi,p2) ={(^1 V ^3 V X4), (xi Vxs V a;6), (^1 V^2 V^s V a;6), (^1 V 
X2 Va ;3 V^s), (^1 V *3 V ®5 V *6)}- LI 

The following lemma states that any Horn theory represented by a dis- 
junction of Horn CNFs has a CNF formula consisting of clauses only from 

HC{pi,p2,---,Pi)- 

Lemma 1 . Let pi,p2, ■ ■ ■ iPi be Horn CNFs, and let p = \l\^iPi. Then p 
represents a Horn theory if and only if p = Aces some subset S C 

HC{pi,p 2 , pi). 

Proof. The if-part is obvious because all clauses c S HC{p\,p2, pi) are 

Horn. For the only-if-part, let p represent a Horn E . Observe that p is equivalent 
to the CNF 



A = Mh A™li • • • AriiAi.u V V ■ ■ ■ V cz, J. ( 3 . 1 ) 

Since p' also represents a Horn theory E and all prime implicates of a Horn 
theory are Horn, for each nontautological clause c' = (ci_q V C2,i2 V • • • V ) 
in p' , there exists a Horn prime implicate c ol E satisfying p' < c < c' . Note 
that HC{pi, p2, . . . .pi) contains all maximal Horn clauses c* with c* < d . Some 
clause c* in HC{p\,p2, . ■ . ,pi) satisfies p' {= p) < c < c* < d. Thus, 
replacing each d in p' by such a c* again produces E, which implies the only-if- 
part. □ 

Theorem 2 . For I bounded by a constant k, the problem in Theorem 1 can be 
solved in polynomial time. 

Proof. (Sketch) By Lemma 1 , for each d in p' of ( 3 . 1 ), we only need to find 
a clause c* in HC{p\,p2, . . ■ ,pi) such that p < c* < d . li each d has such 
a c*, we can conclude that p is Horn; otherwise, we can conclude that p is not 
Horn. As for the time complexity, since < c* is equivalent to the condition 
that Pi < c* holds for all i, and since each pi is Horn, checking (^ < c* is 
polynomial. Furthermore, we have at most a polynomial number of d and c*. 
Thus, the overall time is polynomial. □ 
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Let us now consider computing a Horn core. The following proposition, to- 
gether with Theorem 1, implies that this is a difficult problem in general as 
well. 

Proposition 1. A theory S has a unique Horn core if and only if S is Horn. 

□ 

Corollary 1. Given Horn CNFs <p 2 , ■ ■ • , computing a Horn core of ip = 
Vi=i Fi co-NP -hard. □ 

This is a rather negative result. The proof does not apply for a small (bounded 
by a constant) number of Horn theories, though. We next show that for two Horn 
theories {I = 2), the problem is polynomial. 

3.1 Horn Cores of the Disjunction of Two Horn Theories 

We start with the following lemma showing that any Horn core of a disjunction 
of Horn CNFs can be represented by a Horn CNF consisting of only clauses from 
HC{pi,p 2 , ■ ■ • , Pi)- By Proposition 1, this can be seen as a generalization of 
the only-if-part of Lemma 1. It follows from a similar result for CNF theories. 

Lemma 2 (cf. [10]). Let pi, p 2 , ... T Pi be Horn CNFs, and let ip he any Horn 
core of p = \/\^iPi. Then ip = Aces'" holds for some subset S C 

HC{pi,p2,....pi). 

Proof. (Sketch) As p is equivalent to the CNF p' of (3.1) and ip is a, Horn core 
of p, each nontautological clause c' in p' is subsumed by some prime Horn im- 
plicate c of '0 (i-e., c' > c). Since HC{p\,p2, ... ,pi) is the set of all the maximal 
Horn clauses subsuming at least one clause in p' , some c* G HC{p\, p2 , . . ■ , pi) 
exists such that c < c* < c'. The CNF ip' obtained from p' by replacing each c' 
by such a c* satisfies ip < ip' < p' {= p); this implies ip = ip' . □ 

Note that the converse is not true in general, even if ip is the CNF ob- 
tained from p' as in (3.1) by replacing each nontautological clause c' in p' by a 
clause c* G HC{p\,p 2 , . . ■ ,pi) with c* < d . The following example gives such 
an instance. 

Example 2. Let p\ and p 2 be as in Example 1. Then p' of (3.1) is written as 
p' = {x\ V V a;4 V a;6)(xi V T2 V X3 V T5 V xq){xi V X2 V X3 V T5 V xq) (where 
we exclude the tautological clauses from p') . Let us consider a Horn CNF 

Ip = (Ti Vx3 V Xa){xi VT2 V 2:3 V xd){xi V T2 V X3 V T5 V xq). 

Note that all clauses in ip are from HC{p\,p 2 ) (see Example 1), and that ip 
is obtained from p' in the desired way, as {x\ V T3 V X4) < {x\ V T3 V X4 V a;6), 
{x\ V X2 V X3 V T5) < {x\ V X2 V X3 V *5 V xq) and {x\ V T2 V X3 V T5 V a;6) < 
(xi V a;2 V S3 V X5 V le). However, ip is not a Horn core oi p = pi\/ p2, since 

Ip' = {x\ Vs3 V Xi){xi Vs2 Vs5 V S6)(si Vs2 VS3 VS5 V Se) (3.2) 
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satisfies -0 < - 0 ' < (= tp), which shows that 0 is not Horn core of ip. In fact, 

0 < 0' follows from ip < (xi\/x2Vx5 \/ Xq) , and this combined with 0(110111) = 0 
and 0 '( 11 O 111 ) = 1 implies ip < ip' . As shown later, ip' of ( 3 . 2 ) is a Horn core 
of ip. □ 

Now we give an algorithm which computes a Horn core of the disjunction of 
two Horn theories. 

Algorithm CORE 

Input: Horn CNFs ipi = ci,i and ip 2 = Aj^i C2.j. 

Output: A Horn core ip oi p = ip\\/ ip 2 . 

Step 1. S := {c*j = ci,i U C 2 ,j \ c* j ^ T, i = 1, 2, . . . ,mi, j = 1, 2, . . . , m 2 }; 
52:={c€S| |P(c)| = 2}. 

Let cpj (resp., cfj) be the Horn clause c such that 
N{c) = N{c*j) and P(c) = P(ci,i) (resp., P(c) = P{c 2 ,j)). 

Step 2. Sa ■.= S \ S 2 ; Sb ~ S 2 ; 

for each c*j £ S 2 do 

if pi < Cij and p 2 < cjj then 

begin Sa := Sa U {cA}; Sb := Sb \ {c*j} end 

elseif pi < cpj and p 2 < di,j then 

begin Sa := Sa U {c^}; Sb := Sb \ {c*j} end; 

Step 3. Output Ip := AceSa ^ Ac* ^.eSt Ci.j- ° 

Example 3. Let us apply algorithm CORE to Horn CNFs p\ and p2 given in 
Example 1 . In Step 1 , we have S = {(TiW 3 Va: 4 Va; 6 ), (Ti VT2Va;3VT5Va;6), (Ti V 
X2 V X3 V X5 V le)} and S2 = {(Ti V X3 V 2:4 V le), (Ti V X2 V 2:3 V T5 V 2:6)}. The 
clause C2 1 = (Ti V T 2 V 2:3 V X5 V 2:5) satisfies the if-statement (i.e., pi < and 
< c\ i holds for c\ i = (xi V T2 V X5 V 2;e)); the other clause in S2 satisfies 
neither the if- nor the elseif-statement. Thus, Step 3 outputs the Horn CNF 
of ( 3 . 2 ). □ 

Observe that HC{p\,p2) is the set of all clauses c\ j and and that this 
algorithm runs in polynomial time. Indeed, both implication tests in the “if” of 
Step 2 are solvable in linear time since p\ and p2 are Horn (cf. [ 5 ]), and all other 
steps are clearly polynomial. 

Theorem 3 . Let p\ and p2 be Horn CNFs. Then, algorithm CORE computes 
a Horn core ip of p = piW p2 in polynomial time. Moreover, ip includes p\, i.e.. 
Pi < Ip holds. 

Proof. (Sketch) Observe that 



Pi < tp < Pi^ P 2 ( 3 . 3 ) 

obviously holds. Indeed, each clause c in Sa of Step 3 is an implicate of pi, and 
each clause cN is subsumed by some clause of pi; hence, the first implication 
holds. The second holds since piV p2 = Ac* .gS Isolds by definition, and each 
clause c= c* j in (^1 V (^2 is subsumed by some clause in ip. 
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It can be shown that '0 is one of the maximal Horn CNFs '0* such that 
< 0* < V ipi holds, i.e., it is a Horn core of V □ 

We mention that the algorithms for computing a Horn core in [10,2,1] are 
not polynomial on the disjunction of two Horn CNF formulas; they require a 
CNF for input. 

An analysis of the algorithm CORE reveals that it bears no nondeterminism 
in computing a Horn core 0 satisfying < 0 < V(/? 2 - This raises the suspicion 

that a Horn core including ip\ might be unique. This is in fact the case. 

Proposition 2. Let and E2 he Horn theories, and let E = Si\J E2- Then, 
there exists a unique Horn core H that satisfies Ei C H E . □ 

A a consequence, we call the two Horn cores 77i and 772 satisfying Ei C 
7Ji C- El U E2 and E2 C 7 T 2 TJi U E2, respectively, the canonical Horn cores 
of Hi and 772 with respect to E = Ei U E2- Observe that the generalization of 
Proposition 2 to a disjunction of k{> 3) Horn theories does not hold. We note 
the following structural result. 

Proposition 3. Let E\, E2, ■ ■ ■ , Ei be Horn theories. Then A = con- 

tained in every Horn core of E=\J\^,E,. □ 

3.2 Horn Cores of the Disjunction of more than Two Horn Theories 

Let us first show that the direct generalization of algorithm CORE does not 
produce a Horn core for the case Z > 3. 

Example f. Let ipo = {x\ V T2 V xa){x2 V xr){xz V 2:5 V T7), ipi = (^3 V 2:5), and 
P 2 = (^1 Vxs V Xe), and let tp = V V (^ 2 - Then p is equivalent to the CNF 

ip' = (xi VX2 VX3 VX4VX5VX6) (xi VX2 Vx3 VxsVxeVx?) (xi Vxs VxsVxg V xt). ( 3 . 4 ) 

Since no c* in HC(ipo,Tij T 2 ) is an implicate of ip' (= p) (note that, for I = 2, 
HC{pi,p 2 ) is the set of all clauses cjj and cfj), Sb in Step 3 of the generalized 
algorithm CORE, applied to po,pi,p2, is the set of all clauses appearing in p' 
of (3.4). Thus, for the respective S'a = 0 (= {c G S' | |P(c)| < 2}), we obtain 

0 = {x\ V X 2 V 2:3 V Xa){xi VX 2 VX 3 V X 7 )(xi VX 3 V X5 V X7). 

However, this 0 is not a Horn core of p, since 

Ip' = {x\ V *2 VX 3 V X3){xi V T 3 V X 5 V X7) 

satisfies fj < ip' < p' {= p). As shown in Example 5, ip' is in fact a Horn core 
of p. □ 

By using CORE repeatedly, we can give an algorithm to compute a Horn 
core of p. This algorithm is polynomial if I is bounded by a constant k. In- 
formally, it constructs a sequence of (not necessarily strictly) increasing Horn 
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theories ipo < iIji < tjj 2 ^ ■ which is contained in ip. The sequence converges 

to tjj*, which is a Horn core of ip. 

Let ipo,ipi, ■ ■ .,<pi be Horn CNFs, and let CORE(^i,^ 2 ) be a Horn CNF of 
the canonical Horn core of fii with respect to p,i V ^ 2 - Define a sequence ipi, 
z > 0, as follows: 

'00 = 

ipi = CORE(0o,<<3i); 

102 = CORE(0i,(^2); 

ipi = CORE(0j_i, (/?;); 

= CORE(0i.i+j_i,(/jj), z > 0, l<j<l 

This sequence tpi monotonically increases ipo to a Horn core of ip = V!=o ’ 
by first increasing it in ipi, then in ip 2 and so on. However, note that is not 
necessarily a Horn core of ip. The reason is that in building tpi, say, some models 
of ipi may have been excluded which now can be added to the models of 4ii such 
that the Horn property is preserved. To catch them, the algorithm loops and 
reconsiders ipi, ip 2 etc. until no further change is possible. 

Let us denote by 



tj}* = lim tpi 

i — K30 



the limit of the sequence. Observe that tp* = ipk holds for some finite fc > 0. 



Lemma 3. For any Horn CNFs ipo,ipi, . . . ,ipi, the above tp* is a Horn core of 

ip = ipoM ipi\/ . . .\/ pi. 



Proof. (Sketch) By an inductive argument, it is easy to show that every ipi is 
a Horn CNFs satisfying ipi < p. Hence, ip* is a Horn CNF satisfying ip* < p. 
Assuming that ip* is not a Horn core of p, we derive a contradiction. 

In this case, as shown in [7], some model v G E\H* exists such that iT*U{z;} 
is Horn, where H* and E are the theories represented by ip* and p, respectively. 
This V satisfies v G Ei for some i, where Ei is the Horn theory represented by pi. 
Let k be the index such that ipk = ip*. Then, since ip* is the limit, ipk> = ip* 
holds for all k' > k. Consider the least k' > k such that ipk' = CORE('0fe/_i, pi), 
i.e., the second argument is pi. Since ipk’-i = ip* 3’Hd ipk'-i{v) = 0 hold, ip* is 
not a Horn core of ip* W pi, a contradiction. □ 

Based on this lemma, we design the following algorithm CORE*. 
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Algorithm CORE* 

Input: Horn CNFs ipi = A^i Cio'i for i = 0, 1, . . . , h 
Output: A Horn core of (/p = (po V V • • • V (/p;. 

Step 1. Set ij) := tpo', i := 0; changes := 0; 

Step 2. while changes < I do begin 

if i < / then i ■.= i + 1 else i := 1; 

'^ne'w CORE(?/:, 
i/jne-w then 

begin S' := {c G HC{(fio, (pi, . . . , V5i) I < c}; 
i/’ := Aces c; changes := 0; 

end 

else changes := changes + 1; 
end{ while}; 

Step 3. output a Horn CNF i/i. □ 

Note that, during the iteration of Step 2 , the size of a Horn CNF 
il^new might become large. Thus the algorithm replaces tpnew by 
ij;' — AceHC(ipo VI vi)-<l>new<c^- Lemma 2 , this does not affect correctness. 
The algorithm repeatedly calls CORE with ip as the first argument where the 
second argument loops within tpi, (p2, • ■ . , It halts, if no change within I subse- 
quent calls is detected, i.e., CORE(i/i, = CORE(' 0 , (pi+i) = ... = 
CORE(' 0 , (fi-i) holds, where z > 1 and z — 1 = Hf z = 1 . 

Example 5 . Let us apply the algorithm CORE* to ip = cpo V (fi V (p2, 
where ipo, ip\ and ip2 are as in Example 4 . In the first iteration of Step 2 , 
using algorithm CORE from Subsection 3 . 1 , we obtain 

'Ipnew = CORE((po,</?i) = (xi VX2 Vxs V a;5)(x2 Vxs V a;5)(T3 V X5 Vxy). 

We have 7 LC((po, <^2) = { (Ti VT2VT3Va:4)(Ti VT2Vx3Va:5), (xi Vx2 VT3 Va;6), 
(Ti V X2 V T3 V 2:7), {x\ V T3 V xs V T7), {x\ V X3 V le V x-j) }; hence, 

zAi = (Ti V X2 VT3 V xsllTi VT3 V 2:5 V X7). 



In the second iteration, we have ipnew = CORE(z/;i, (^ 2 ) = ipi, and hence 
ip2 = ipi holds. Since I (= 2) consecutive members ipi and ip2 satisfy ipi = ip2, 
the algorithm outputs a Horn CNF ip 2 and halts. □ 

Theorem 4. Algorithm CORE* outputs a Horn core ip 0/ (p = (po V V • • • V (p/ 
satisfying ipo < ip < ip. Moreover, it is polynomial if I is bounded by some 
constant. □ 



4 Characteristic Models 

For any Horn theory S, a model z; € if is called characteristic [9], if 
V ^ Cl/x{E \ {z;}). The set of all characteristic models of E, the characteris- 
tic set of E, is denoted by C*{E). Note that every Horn theory E has the 
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unique characteristic set E.g., consider the Horn theory E = {(0101), 

(1001), (1000), (0001), (0000)}. Then C*{E) = {(0101), (1001), (1000)}. In the 
rest of this section, we reconsider the problems in the previous section, assuming 
that Horn theories are represented by their characteristic models. We give here 
a summary of results and refer to [7] for further details. 

Like in the case of CNFs, deciding whether a disjunction of Horn theories is 
Horn is intractable in general. 

Theorems. Given the characteristic sets C*{E{), C*{E 2 ), ■■■, C*{Ei) of 
Horn El, E^, ■ ■ ■ , Ei, deciding whether E = Ei is Horn is co-NP -complete. 

□ 

The next result gives a precise semantical characterization of the Horn prop- 
erty of a disjunction. 

Theorem 6. Let Mi C {0, 1}", i = 1, 2, . . . , L Then E = Ui=i is Horn 

iff AdgS^ ^ ^ holds for every S C Mi such that 1 < |S'| < L □ 

As an immediate consequence, we obtain the following result. 

Theorem 7. For I bounded by a constant k, the problem in Theorem 5 is poly- 
nomial. □ 

Theorems 5 and 7, respectively, coincide with Theorems 1 and 2, in which 
input Horn theories are represented by Horn CNFs. 

For the case where Horn theories are represented by a CNF, we have presented 
in the previous section a polynomial-time algorithm for computing a Horn core ip 
satisfying ipi < "p < (fii V (p 2 - For characteristic models, a similar algorithm is 
hard to be found. This is suggested by the following theorem, which tells that 
recognizing a Horn core is intractable, even for the case of two Horn theories. 

Theorem 8. Given the characteristic sets C*{Ei) and C*{E 2 ) of Horn theories 
El, E 2 C {0, 1}", deciding whether Ei is a Horn core of E = EiU E 2 is co-NP- 
complete. □ 

We remark that computing the Horn envelope of a disjunction ^ = ul ,1 E,, is 
polynomial under characteristic model representation, while it requires in general 
exponential time (and space) if Horn CNFs are used. Indeed, the characteristic 
set of E is given by C*{M), where M = lJi=i C*{Ei). Clearly, C*{M) is com- 
putable from M in polynomial time, by repeatedly removing from M a model v 
which is represented by the intersection of other models in M , until no longer 
possible. 

On the other hand, any Horn CNF ip representing E may be exponential in 
the size of Horn CNFs ipi,ip 2 , . . . ,pi representing Ei,E 2 ,. . ■ , Ei, even if Z = 2. 

E.g., let ipi = Xq and ip 2 = {xiVx 2 . ■ EXm)L./\"fi{xjVyj). Then ip = ipiVip 2 
is equivalent to the CNF (iq VTi VT 2 . . . VTm) A Ajli(®o V Xj yfij). The Horn 
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envelope of ip is represented by the conjunction of all Horn prime implicates 
of ip, i.e., 



^ /\z2G{x2,V2} Vj = l ■^i)’ 

As easily seen, no clause in ijj is redundant. Hence, tp is the shortest Horn CNF 
for the Horn envelope of p, and its size is exponential in the sizes of p\ and p 2 - 

5 Conclusion 

In this paper, we considered the Horn cores and the Horn envelope of a disjunc- 
tion S = [j^ Si of Horn theories Si. We have proven that checking whether S is 
Horn is in general co-NP-complete for both representations, but is polynomially 
solvable if I is bounded by a constant. We have also shown that, if I is bounded 
by some constant, a Horn core II of S satisfying Si Q II Q S can be computed 
from Horn CNFs in polynomial time, while it is co-NP-hard from characteristic 
models, even if Z = 2. As for the Horn envelope, we have shown that it can 
be computed from characteristic models in polynomial time, but it cannot be 
computed from Horn CNFs, even if Z = 2. 

One of the further issues remaining for research is the problem of computing 
a Horn core II of S (which might not satisfy Si C II C S) from characteristic 
sets, in the case in which Z is bounded by some constant. 
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Abstract. We formalize and investigate a model for zero-knowledge 
proofs of “program result-correctness”, which naturally extends Blum’s 
theory of program checking by adding zero-knowledge requirements. The 
zero-knowledge requirements are universal for yes and no instances alike. 



1 Introduction 

Suppose a company has developed an efficient heuristic or an excellent algo- 
rithm to solve a problem, for which no efficient algorithm has yet been found 
(examples of such problems are “quadratic residuosity” (QR) [20] and “graph 
isomorphism” (GI) [26]). To sell or license the program, the company has to an- 
nounce the availability of the new algorithm and its salesmen must demonstrate 
its correctness effectively. A serious problem for the salesman is that the demon- 
stration should be carefully crafted: It should not reveal information to tricky 
customers who may seek getting important algorithmic details. Furthermore, 
while demonstrating on these tricky clients’ instances, the actual algorithm’s 
output should not be revealed (for free). We note that (1) the salesman need not 
know the algorithm itself (it should suffice that he knows how to demonstrate 
the correctness of the program by interacting with it), and (2) of course, during 
the demonstration he needs to use only the program and not, say, an additional 
communication to a super-computer. 



1.1 Our Results and Relationships to Known Results 

A starting approach to this problem may be to try to employ the theory of pro- 
gram checking introduced by Blum and Kannan in [12]. It has strong connections 

* Partially done while visiting Columbia University. 
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with interactive proof systems of membership, introduced by Goldwasser Micali 
and Rackoff [22]. In fact, [12] concretely proposed a method for checking the 
correctness of a program deciding whether two given graphs are isomorphic or 
not by using an interactive proof system for membership of GI and one of its 
complement GNI of [21]. However, the salesman cannot use directly this proto- 
col since it reveals the bit indicating isomorphic/non-isomorphic and eventually 
(via self reducibility) the isomorphism between the two input graphs as well. 
A second approach to the salesman’s problem might be to try to directly use 
the theory of zero-knowledge interactive proof systems of membership. E.g., [22] 
proposed a protocol to show that a given integer is a quadratic residue (QR), 
and a different protocol to show that a given integer is a quadratic non-residue 
(QNR). Both protocols are zero knowledge and do not give additional informa- 
tion. If we try the combined protocol where if the integer is a quadratic residue, 
then the prover executes the first protocol and otherwise, the second. Unfortu- 
nately, this protocol still leaks this one-bit information. Also, it seems that we 
need to formulate the fact that the program really “knows” the output (rather 
than merely a proof of membership). Thus more care is needed. 

The study of zero-knowledge proofs of knowledge which do not even reveal 
the one-bit information of whether a particular statement, is true or false was 
initiated by Feige, Fiat, and Shamir [16]; however, their model deals with a 
prover who has a special NP-knowledge. Recently Di Grescenzo, Sakurai, and 
Yung [14] gave new zero-knowledge protocol techniques and showed a perfect 
zero-knowledge protocol for proving the result-correctness of a program for 
a random self-reducible problem. A zero-knowledge proof of program result- 
correctness is an interactive protocol between an efficient prover and an efficient 
verifier. The goal is to transfer the correctness of the output returned by a pro- 
gram on a given input x and proving it correct without revealing any information. 
This model of [14] is a proper generalization of that of [16] (with no restriction 
to NP n co-NP), motivated us to consider the possibility of zero-knowledge re- 
quirements in program checking [12]. In the context of [12] (as well as in the 
model of [8] which is a proof of membership motivated by program checking), 
limiting the power of the prover to the power of the program checked is impor- 
tant. In this paper we explore which programs can be demonstrated correct in 
a zero-knowledge fashion. Next we show why the models in [16,14] fall short of 
capturing this last constraint. 

Perfect Zero-knowledge and subtlety with previous protocol tech- 
niques 

We first consider methods to demonstrate result-correctness of program in per- 
fect zero-knowledge, i.e., without revealing any information to infinitely-powerful 
adversaries. We observe that the protocol in [16] implies that a polynomial-time 
prover can show that he can decide whether a given input belongs to the lan- 
guage of quadratic residuosity modulo Blum integers or not. However, the prover 
requires the knowledge of the complete factorization of the modulus, which is 
believed to require greater computational power than only to decide quadratic 
residuosity. Thus, the discussion above fails to meet our original goal, in which 
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the prover requires only the minimal power of a correct program to solve QR 
and the prover is otherwise limited to efficient computation (thus, may not be 
able to factor numbers). A similar problem arises when we apply the general 
protocol in [14] for random self-reducible languages to the case of QR problem. 
Thus, no previous model and general technique induces the possibility of proving 
in perfect zero-knowledge the result-correctness of a program to solve QR. By 
designing a new protocol, this paper affirmatively answers to this problem for 
QR under the promise that the modulus is a Blum integer. We also notice that 
the protocol for GI in [14] works within the model formulated here as well. 
Upper bounding the class of problems with perfect zk-protocols 
We next give a theoretical upper bound to the class of problems having per- 
fect/statistical zero-knowledge proofs of result-correctness, which suggests that 
such stronger security limits the power of these demonstration systems. A tool 
used by our proof is the upper bound to the power of the prover in statistical 
zero-knowledge proofs of membership given in [11], and our argument makes a 
new use of the (knowledge-)extractor. 

The computational-zk case 

Finally, we consider methods to demonstrate result-correctness of program in 
computational zero- knowledge, i.e., without revealing any information to a 
polynomially-bounded adversary. Feige et al. [16] showed, under the assump- 
tion that secure encryption schemes exist, the prover can show that it possesses 
a witness to a statement in NP n co-NP without telling anything new about 
it (not even whether the prover found a proof or a counterexample). We ex- 
tend this result to PSPACE-complete decision problems under our model, and 
show that assuming one-way functions, the correctness of the program to solve 
PSPACE-complete decision problems can be demonstrated in computational 
zero-knowledge . 

1.2 Related Work 

The notion of “hiding information of inputs from an oracle” of Abadi, Feigen- 
baum and Kilian [1] considers the issue of the requester hiding its input from the 
program (or computational device), and its zero-knowledge aspect is investigated 
by Beaver, Feigenbaum and Shoup in [.5]. Our model can be viewed as treating 
the opposite direction: how to protect the (results of) the program/computing- 
device. 

Frankel, Gemmell, and Yung [17] introduced the notion of cryptographic 
program checking. They considered the security needs of the program owners 
as our work, thus we next discuss the difference between [17] and ours. In [17] 
model, after the interaction between the owner and the checker, the checker 
knows the (correctness) result y = P{a) of the program P of the owner for a 
given input a. Namely, the [17] ’s requirement of security against the checker is 
leaking “no additional knowledge” as in [19,15] where the result is given to 
the verifier. Whereas our requirement of security is stronger: the checker cannot 
learn anything, even the result of the program P for a given input a as the 
security requirement in [16,14]. 
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The approach developed in [17] is the “witness-based” approach. Namely, 
to use a (small) trusted collection of instances of inputs and their corrected 
outputs that the checker holds in advance. The idea of zero-knowledge checking 
by viewing the result for a randomly selected set of witnesses was given in Yung’s 
“proof of computational power” [30] (which is another motivating model); but 
there the verifier should be able to sample these instances. In contrast, our 
model is more general: the checker assumes to hold no witnesses (just black box 
access to the output of a decision-problem solver, and no witness sampling) . We 
remark that the “witness-based” approach [17] is working well for the problem 
with algebraic properties (such as random self-reducibility) [1]. In particular, it 
is useful for the application of robust homomorphic function sharing [13]. On the 
other hand, our model can be applied to the problem without such homomorphic 
properties (e.g. Graph Isomorphism). From complexity-theoretical point of view, 
the model in [17] can be regarded as a special case of the previous interactive 
proofs [22,19,12]. Thus, the known complexity results on the previous [22,19] 
can be applicable to model of [17]. However, our model is completely new, so no 
previous complexity theoretic result can directly apply. For example, showing 
a theoretical upper bound on the class proved in our model requires a new 
argument (e.g. for Theorem 3). 

2 The Proofs of Result-Correctness: Model and 
Complexity 

2.1 Notations 

If A and B are two interactive probabilistic Turing machine, by pair (A,B) we 
denote an interactive protocol ([22]). Let x be an input common to A and B. By 
ti'(A,B)(a^) we denote the transcript of an execution of protocol (A,B) on input x, 
that is, the messages written on V’s communication tape during an execution 
of protocol (A,B) on input x. By OUTB(tr(A,B)(a;)) we denote B’s output given 
the transcript tr(A,B)(2:)- 

If L is a language, by xl ■ {0, 1}* ^ {0, 1} we denote the characteristic function 
for the language L (i.e., xl{x) = 1 if and only if x € L). 

2.2 Program Checking and Competitive Interactive Proofs 

Before defining our new model, we recall the notions of program checking and 
competitive interactive proof systems. Program checkers was introduced in [12]. 



Definition 1 ([12]). Let C be a PPT oracle Turing machine. We say that C a 
checker for a function F : {0, 1}* ^ {0, 1}* if for all program tt and all x € {0, 1}* 
it is the case that 

1. If n{y) = F{y) for all y G {0,1}*, then Prob(OUTc(tr(,,_C)(®)) = 
ACCEPT) > l-2-l’’^l. 
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2. If 7t(x) ^ F(x), then Prob(OUTc(tr(^,C) (a;)) = ACCEPT) < 

Note that the probabilities are over the random coin tosses of the checker C. 

The following type of interactive proof systems was considered in [8] . 

Definition 2 ([8]). Let A and B be two interactive probabilistic polynomial- 
time (PPT) Turing machines. We say that (A,B) is a competitive interactive 
proof for membership of a language L if the following two conditions hold. 

1. For all xeL, Prob(OUTs(tr(AL,B)(a;)) = ACCEPT) > 1 - 2-l"^l. 

2. For all x ^ L, and all functions A*, Prob(OUTB(tr(A*,B)(a:)) = ACCEPT) < 

2“l“l. 

2.3 Definition and Properties of Proof Systems of 
Result-Correctness 

Informally, a proof system of program result-correctness is an interactive proof 
system between a PPT prover having access to the program to be proved correct 
and a PTT verifier. The prover’s goal is to transfer the correctness of the output 
of the program on a given input x through this interactive protocol. 

The formal definition for proofs of program result-correctness has two require- 
ments: competitive regularity and validity, which we first informally describe. 
The regularity requirement states that the verifier accepts with high probability 
for any input x. The competitive regularity further requires that the PPT prover 
can make the verifier accept by just using the program as an oracle [8]. The va- 
lidity requirement states that there exists an extractor that, for any input x, and 
interacting with any prover that forces the verifier to accept with ‘sufficiently 
high’ probability, is able to compute the correct output of the program (i.e., 
to compute the decision predicate that the program is supposed to compute), 
within a ‘properly bounded’ expected time. We remark that this differs from 
previous work on proofs of knowledge in which the extractor existed only for 
input in the language and was required to output a string satisfying a polyno- 
mial relation with the input. Our approach allows to consider even languages 
beyond NP. The basic approach in defining the validity requirement follows the 
definition for proofs of knowledge given in [7]. 

Definition 3. Let P : {0, 1}* ^ {0, 1} be a decision problem, err : {0, 1}* ^ 
[Ql] be a function, and let ttp be a program supposed to compute P. Also, let A 
and B be two interactive PPT Turing, where A has access to program irp. We 
say that (A,B) is a proof system of program result-correctness with knowledge 
error err for P if: 

1. (Competitive Regularity) 

If for all X, TTp{x) = P{x) then Prob(OUTp(tr(A,B)(a:)) = ACCEPT) > 

1 - 2“l"’l. 

2. (Validity) There exists a probabilistic oracle machine E such that for all x 

and any Turing machine A', and letting =Prob(OUTB(tr(-A',B)(2^)) = 

ACCEPT), if accA'{,x) > err{x) then. 
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- Prob(OUTi^(tr(A^E)(^)) = P{x)) > 1 - 

1 3? I 

— The machine E halts within expected time bounded by r’-y ttt i 

^ ■' (accj^/ (x)—err(x)} ’ 

for some constant c > 0. 

2.4 Complexity of Proof-Systems of Result- Correctness 

If the decision problem P has a proof system of program result-correctness, 
we will also say that the language L has a proof system of program result- 
correctness, where L is the set of strings x such that P{x) = 1. Clearly, any 
competitive interactive proof of membership for a language L is a proof of pro- 
gram result-correctness for the language L. In particular, from [29], we obtain 
that there is a proof system of program result-correctness for PSPACE. We ob- 
serve that the above definition can be easily generalized to search problems. 
Here, using the result of [27] one obtains that any #P-complete function has 
a proof system of program correctness. Furthermore, we can give the following 
upper bound for the model of proofs of program result-correctness. 

Proposition 1 If there exists a result- correctness proof system (with knowledge 
error 0) for a program tt deciding a language L, then L is in PSPACE. 

To show that this upper bound is true, one can just observe that the honest 
prover can be clearly implemented in PSPACE and the extractor runs in ex- 
pected polynomial-time when interacts with the honest prover; then just run an 
algorithm in _ psp^CE that simulates the interaction between 

the prover and the extractor. 

Babai, Fortnow, and Lund [4] proved that every EXP-complete language has a 
function-restricted interactive proof system (namely, a proof system where the 
honest prover decides the language and the dishonest prover is a function from 
the set of instances to { yes, no }). This, using the checker characterization the- 
orem of [12], implies that every EXP-complete language has a program checker. 
Thus, we conclude that: 

Proposition 2 Not all languages having a program checker have a proof system 
of result-correctness (with knowledge error 0) unless EXP = PSPACE. 

2.5 Security against Verifiers 

We now add security concerns to the newly introduced proof-systems. The zero- 
knowledge requirement for these proof systems of program result-correctness 
is defined similarly as for interactive proof systems of membership. We first 
recall that the view of the verifier is everything he reads from his tapes during 
an interaction with the prover: formally we define Viewy {x), V’s view of the 
interaction with P on input x, as the probability space that assigns to pairs 
(i?; trp v(R)(a:)) the probability that R is the tape of V’s random bits and that 
trp v(R)(^c) is the transcript of a conversation between P and V on input x given 
that R is V’s random tape. 
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We notice that in the case of proofs of program result-correctness for a lan- 
guage L, the zero-knowledge condition requires that the simulator exists both if 
X £ L and \i x ^ L (i.e. for any input x € {0, 1}* ), whereas a zero-knowledge 
proof for membership for a language L requires the existence of the simulator 
only when x € L. 

Definition 4. Let (A,B) be an interactive proof system of program result- 
correctness with knowledge error err for P. We say that (A,B) is a perfect 
zero-knowledge proof system of program result- correctness with knowledge er- 
ror err for P if for each B' there exists a probabilistic machine 5 b' running in 
expected polynomial time such that for all x the probability spaces ViewBi {x) 
and 5B'(a;) are equal. 

Intuitively, the definition above says that the verifier cannot get any information 
after its interaction with the prover, not even the value of P{x). 

We note that in our previously developed protocol for GI [14], the prover is 
competitive (since Gl-search reduces to Gl-decision), which implies: 

Theorem 1. Graph Isomorphism has a perfect zero-knowledge proof of pro- 
gram result-correctness (with knowledge error 0). 

We also consider more restricted security according to the level (namely, statis- 
tical and computational) of indistinguishability. 

Definition 5. Let (A,B) be an interactive proof system of program result- 
correctness with knowledge error err for P. We say that (A,B) is a statistical 
zero-knowledge proof system of program result- correctness with with knowledge 
error err for P if for each B' there exists a probabilistic machine 5 b' (called the 
simulator) running in expected polynomial time such that for all x, the proba- 
bility spaces ViewB'{x) and 5B'(a:) are statistically indistinguishable; that is, for 
any constant c and any x, \ {proh{ViewB'{x) = a) — prob^Ss' (x) = a) ) | < 

l/|x|L 

Definition 6. Let (A,B) be an interactive proof system of program result- 
correctness with knowledge error err for problem P. We say that (A,B) is a 
computational zero-knowledge proof system of program result-correctness with 
knowledge error err for P if for each B' there exists a probabilistic machine Sb> 
(called the simulator) running in expected polynomial time such that for all x, 
the probability spaces ViewB' {x) and Sb> (x) are computationally indistinguish- 
able; that is, for any poly-size family of circuits {C„}, for any constant c and 
any x, \ prob{Cn{ViewB' (x)) = 1) - prob{Cn{SB' (x)) = 1) | < l/|a:|^. 

3 A Perfect Zero-Knowledge Protocol for QR Modulo 
Blum Integers 

We consider the problem of proof system of program result-correctness which is 
perfect zero-knowledge. Our result is a protocol for the language of quadratic 
residuosity in the case the modulus is a Blum integer [10]. 



66 



Giovanni Di Crescenzo et al. 



3.1 Blum Integers and Related Properties 

We informally state some properties about Blum integers and quadratic residu- 
osity, see [28,20] for formal descriptions and proofs. 

An integer a; is a Blum integer if and only if a; = where p and q are 

different primes satisfying p = q = 5 mod 4 and k\ and k 2 are odd integers. Let x 
be a Blum integer, and let Z* be the multiplicative group modulo x. Define 
and Z~^ to be, respectively, the sets of elements of Z* with Jacobi symbol +1 
and —1. Define the set QRx = {y £ Z*\3u G Z* : y = v? mod x} of quadratic 
residues modulo a;, and the set QNR^ = {y £ Z^^ \ £ Z* : y = v? mod a;} 

of quadratic non residues modulo a;. If a; is a Blum integer, then —1 mod x is 
a quadratic non residue with Jacobi symbol +1. This implies that on input a 
Blum integer x, it is easy to generate a random quadratic non residue in Z'^^: 
randomly select r £ Z* and output — mod x. Moreover, for this special class 
of integers we have that for any yi,y 2 £ Z%^ the product yiy 2 is a quadratic 
residue if and only if both are residues or both are non-residues. The 

quadratic residuosity assumption states that it is hard to distinguish the two 
sets QRxtQNRx and is widely used in the construction of systems for various 
cryptographic applications. The better algorithm known to distinguish QRx from 
QNRx requires the factorization of the modulus, another well-known problem 
which is believed hard. 

3.2 A Competitive Protocol for QR^, 

Competitive interactive proofs of membership for quadratic residuosity have 
been investigated in [8,24]. Our protocol is different in two aspects. Namely, 
it is a proof of program result-correctness and it is perfect zero-knowledge: thus, 
it allows the prover to show that his program returns the correct output on a 
common input y, without revealing whether y £ QRx or y £ QNRx, or any 
other additional information. 

First, we recall two simple observations. As already noticed in [8], the proof 
system of membership for QNRx given in [22], where x may be any integer, is 
competitive. Then, if a: is a Blum integer, one obtains a competitive proof sys- 
tem of membership for QRx as follows: on input v, the prover and the verifier 
run the competitive protocol for QNRx on input —v mod x. Both competi- 
tive proof of membership will be used as subprotocols in our proof of program 
result-correctness for QRx- Then, we can informally describe our protocol (A,B) 
in the following way: on input y, A uniformly generates two integers Zq, z\ such 
that exactly one has the same quadratic residuosity as the input y. Then B 
sends two random bits b,c. Now, if 6 = 0, A reveals the random coins used 
to compute ^ 0 , 2 : 1 . In this step A shows that have been correctly com- 

puted. On the other hand, if 6 = 1 then A considers the integer Zc sent be- 
fore: if Zc is a quadratic residue, then he interactively proves the statement 
‘2c £ QRx' to B, using the above competitive proof of membership; if Zc is a 
quadratic non residue, then he interactively proves the statement ‘zc £ QNRx’ 
to B, using the above competitive proof of membership. In this step A shows 
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that if zo,zi have been correctly computed, then he has a program that can 
decide whether y is a quadratic residue or not. B accepts if he is convinced from 
such proof. We denote by (C,D) the proof system of membership given in [22] 
for quadratic non-residuosity. A formal description of (A,B) is as follows. 

The Protocol (A,B) 

A: Uniformly choose a bit a and two numbers r-Q,ri G Z*; set Za = 
y X mod X and z\-a = —y x (ri-a)^ mod x. Send (zo,zi) to 

B. 

B: Uniformly choose two bits 6, c, and send (b,c) to A. 

A: Send a to B. If 6 = 0 then send xq, ri to B. Otherwise if Zc G QRx, 
set d = 1 else set d = 0; send c? to B and prove that (—1)'^ x 
mod X G QNRx (this step is executed as follows: A and B run 
the protocol (C,D) on input (—1)'^ x mod a;; where A runs C’s 
program and B runs D’s program). 

B\ If 6 = 0 check that zq, z\ are correctly constructed; if 6 = 1 check 
that the proof that (— 1)"^ x Zc mod x G QNR^ is convincing. 



It is possible to prove that (A,B) is a perfect zero-knowledge proof of program 
result-correctness with error 1/2 for QRx- Define the protocol (P,V) as the se- 
quential execution of n copies of (A,B), where V accepts if B does not reject in 
any of the n iterations. Thus, we obtain the following 

Theorem 2. The protocol (P,V) is a perfect zero-knowledge proof of program 
result-correctness (with knowledge error 0) for the language QRx 

Proof. Competitive regularity. The requirement of competitive regularity follows 
from the properties of the two subprotocols, which are also competitive, as seen 
before. 

Validity. To see that the validity requirement is satisfied, observe that we can 
construct the following extractor E (we omit here the analysis of the trivial 
cheating strategies by P'). On input y, E runs in parallel the two following al- 
gorithms, and halts when any of the two outputs first. The first algorithm is an 
exponential-time search algorithm to compute whether the input y is a quadratic 
residue modulo x or not (such an algorithm may be for instance to enumerate 
all integers z < x and see if z G and z^ mod x] y is a quadratic residue 
if and only if such a z is found; and the running time of this algorithm is at 
most |a:|^ • 2l“l, for some constant k). The second algorithm is the following: E 
runs the program of the verifier V, interacting with P', and then completely 
rewinds the prover P' and starts again, using independently and uniformly cho- 
sen random bits, until he gets two accepting conversations such that for 

some j = 1, . . . ,n, where by 6* we denote bit b sent by E in the j-th sequen- 
tial repetition of the i-th accepting conversation. If he has obtained these two 



68 



Giovanni Di Crescenzo et al. 



accepting conversations, he has obtained also a pair of integers (rQ,ri_a) such 
that r'^ X y = Za mod x, x y = -^i-a mod x and a convincing proof that 

one of Za,zi-a is a quadratic residue or non-residue modulo x. This clearly al- 
lows him to obtain the quadratic residuosity of y (for instance, if z\-a is proved 
as a quadratic residue modulo a;, then y is a quadratic residue). 

To show that the result is correct with high probability, we observe that if E 
runs the exponential-time search algorithm, then clearly his output is correct 
with probability 1. Otherwise, the probability that E returns a wrong value for 
the quadratic residuosity of y is at most the probability that a cheating P' is 
able to convince him that Zc^ is quadratic residue while it is not (or vice versa), 
and this probability is negligible from the soundness of the proof system of 
membership used. 

To show that the expected running time is properly bounded (i.e., polynomially 
proportional to the reciprocal of the acceptance probability of prover P'), we 
observe the following. We consider two cases according to whether the number 
of accepting conversations that may be possibly obtained interacting with P' 
is 1 or at least 2 (if such number is 0, we don’t need to prove anything, since 
accp'{x) = 0). Assume such number is 1. Then E will output because of the 
exponential-time algorithm since he will never obtain two accepting conversa- 
tions from P'. However, accp'{x) < since there exist exactly one value 

of random bits 6i, . . . , for which the conversation is accepting, and thus the 
expected running time is bounded by |a;|^ • 2l’^l = |x|^/accp/(a;). Now, assume 
the number of accepting conversations that may be possibly obtained interacting 
with P' is at least 2. Then E may output because of the fact that he obtained 2 
accepting conversations from P'. To see that E’s time is properly bounded even 
in this case, observe that the expected number of rewindings of P' needed to 
obtain two accepting conversation can be bounded as at most 3 jaccp' . Then we 
just observe that at each rewinding E runs in polynomial time. 

Perfect zero-knowledge. Consider a simulator S that uses at each iteration the 
standard trial and error strategy. Namely, S will prepare a pair (zq,zi) such 
that he can answer either to 6 = 0 or to 6 = 1, randomly choosing the case; 
then, he guesses V' question b with probability 1/2, and thus output in expected 
polynomial-time. To prepare the pair (zq,Zi) so that he can answer to case 
6 = 0, he randomly chooses a € {0, 1} and ro,ri G Z* and then computes Za = 
y X mod X and Zi-a = —y x (ri_a)^ mod x. The answers will be tq, ri. To 
prepare the pair {zq, Zi) so that he can answer to case 6 = 1, he randomly chooses 
a G {0,1} and ro,ri e Z* and then computes Za = (^a)^ mod x and z\-a = 
— {ri-af' mod X. In this case S knows the quadratic residuosity of Zc and thus 
can simulate the proof by P by running the simulator for the protocol for QNR^ 
on input Zc or —Zc mod x according to which is the case. Observe that the output 
of Sy' and the execution of protocol (P,V') on input y are equally distributed. In 
particular, we observe that the pair {zq,z\) is equally distributed as a randomly 
ordered pair of a random quadratic residue and a random quadratic non-residue 
for both values of 6. 
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4 An Upper Bound for Statistical ZK Proofs of 
Result-Correctness 

We give an upper bound on the class of languages having perfect/statistical zero- 
knowledge proof systems of result-correctness. Namely, we prove the following 

Theorem 3. Let L be a language. If L has a statistical zero-knowledge proof 
system of program result-correctness (with knowledge error 0), then L is in 

BppNP 

Basic idea and techniques used: We use a result by Bellare and Petrank [11] 
that investigated how much power is sufficient for the prover to execute a sta- 
tistical zero-knowledge protocol. Namely, they showed that any statistical zero- 
knowledge proof system of membership for a language L can be modified into 
a statistical zero-knowledge proof system for L, in which the prover runs in 
PPT with an NP-oracle. The same construction works also for any statistical 
zero-knowledge interactive protocol and thus also for proof systems of result- 
correctness. 

Then a sketch of our construction is the following: given a statistical zero- 
knowledge proof system of result-correctness (A,B) with knowledge error 0 for L, 
we apply the mentioned transformation to A and obtain a statistical zero- 
knowledge proof system of result-correctness (M^^,B) with negligible knowledge 
error. Then (M^^,B) has an extractor E that can decide L within a ‘properly 
bounded’ expected time (specifically, polynomially and inversely proportional to 
the acceptance probability of the possibly cheating prover P' that he is inter- 
acting with). Then one can construct a BPP'^^ algorithm D which simulates 
a polynomial number of executions of E and runs the machine to answer 
to E’s queries. Clearly, at least one of E’s executions run by D will halt; and, 
finally D returns E’s output. Details of construction and proof are omitted. 



5 Computational Zero-Knowledge Proofs of Program 
Result-Correctness 

This section studies the power of computational zero-knowledge proofs of pro- 
gram result-correctness and prove that all PSPACE-complete languages have 
such a proof under the assumption of existence of secure probabilistic encryp- 
tion schemes ([20]). A tool we use is the following theorem given in [25,9]. 

Theorem 4. [25,9] Let (A,B) be an interactive proof system of membership 

for a language L. If secure bit commitment schemes exist, then there is a com- 
putational zero-knowledge proof system of membership (P,V) for L. 

We show that: 

Theorem 5. Let L be any PSPACE-complete language. If secure bit commit- 
ment schemes exist, then (P,V) is a computational zero-knowledge proof system 
of program result-correctness (with knowledge error 0) for L. 
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We informally describe our construction. Since P SPACE is closed under com- 
plement, and using Shamir’s result [29], we can construct an interactive proof 
system (Ai,Bi) of membership to L and an interactive proof system (Ao,Bq) 
of membership to L. Now, both proof systems (Ai,Bi) and (Ao,Bq) are trans- 
formed into Arthur-Merlin proof systems with perfect completeness. A further 
step is that of padding the messages exchanged in the two above proof systems 
in some standard way, so that the number of rounds, the length of the messages 
sent by the provers and of those sent by the verifiers, are equal in both protocols. 
Let (A'j^,B'j^) and (Aq,Bq) be the two protocols resulting from such transforma- 
tions. Notice that B]^ and Bq now differ only in the accepting predicate in the 
end. Then, consider the following proof system of membership (A,B) for the lan- 
guage {0, 1}*. On input x, A computes xl{x) and sends a bit 6 = xl{x) to B. 
Then A and B run protocol (A[,,B[,). B accepts if BJ, accepts the conversation 
with A[,. The proof of this protocol’s properties is omitted. 

Open Problems Finding perfect/statistical zero-knowledge proofs of program 
result-correctness for other problems is interesting. A stronger upper bound on 
the class of languages having statistical zero-knowledge proof systems, namely 
that SZKIP C AM n co-AM, has been given in [18,2]. The proof technique 
they use does not extend to proofs of program result-correctness. Whether this 
bound can be obtained also for statistical zero-knowledge proofs of program 
result-correctness is interesting as well. 
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Abstract. We study the two-layer planarization problems that have ap- 
plications in Automatic Graph Drawing. We are searching for a two-layer 
planar subgraph of maximum weight in a given two-layer graph. Depend- 
ing on the number of layers in which the vertices can be permuted freely, 
that is, zero, one or two, different versions of the problems arise. The 
latter problem was already investigated in [11] using polyhedral combi- 
natorics. Here, we study the remaining two cases and the relationships 
between the associated polytopes. 

In particular, we investigate the polytope Vi associated with the two- 
layer planarization problem with one fixed layer. We provide an overview 
on the relationships between "Pi and the polytope Qi associated with the 
two-layer crossing minimization problem with one fixed layer, the linear 
ordering polytope, the two-layer planarization problem with zero and 
two layers hxed. We will see that all facet-defining inequalities in Qi 
are also facet-defining for Vi- Furthermore, we give some new classes of 
facet-dehning inequalities and show how the separation problems can be 
solved. First computational results are presented using a branch-and-cut 
algorithm. For the case when both layers are fixed, the two-layer pla- 
narization problem can be solved in polynomial time by a transforma- 
tion to the heaviest increasing subsequence problem. Moreover, we give 
a complete description of the associated polytope V 2 , which is useful in 
our branch-and-cut algorithm for the one-layer fixed case. 



1 Introduction 

A bipartite graph is a graph G = {A,B,E) with vertex sets A and B, called 
upper and lower layer, and an edge set E connecting a vertex in A with a vertex 
in B. There are no edges between two vertices in the same layer. A bipartite 
graph is two-layer planar G = (A, B, E) if it can be drawn in such a way that all 
the vertices in A appear on a line (the upper line), the vertices in B appear on 
the lower line, and the edges are drawn as straight lines without crossing each 
other. The difference between a planar bipartite graph and a two-layer planar 
bipartite graph is obvious. For example, the graph shown in Fig. 1 is a planar 
bipartite graph, but not a two-layer planar graph. 

Depending on the number of layers in which the permutation of the vertices 
is fixed, different problems arise: 



K.-Y. Chwa and O. H. Ibarra (Eds.): ISAAC’98, LNCS 1533, pp. 69—79, 1998. 
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Fig. 1. (a) A planar bipartite graph that is (b) not 2-layer planar 



— The permutations tt^ and ttb of both layers A and B are fixed: Given a 
two-layer graph G = {A, B, E,tta,t!'b) with weights We > 0 on the edges, 
the two-layer planarization problem (2 layers fixed) is to extract a subgraph 
G' = (A, F, 7T^, 7Tb), F C E, of maximum weight, i.e., the sum '^f.^pWe 
is maximum, which contains no crossings with respect to the given permu- 
tations tta and ttb- 

— The permutation tta of one layer A is fixed: Given a two-layer graph G = 
(A, B, E, 7T^, •) with weights u>e > 0 on the edges, the two-layer planarization 
problem (1 layer fixed) is to extract a subgraph G' = {A,B,F,tta,*), F C F, 
of maximum weight, which contains no crossings with respect to the given 
permutation t:a of the upper layer. 

— Both layers can be permuted: Given a two-layer graph G = (A, F,F, •,•) 
with weights Wg > 0 on the edges, the two-layer planarization problem (none 
layer fixed) is to extract a two-layer planar subgraph G' = (A, B, F,u,u), 
F C F, of maximum weight. 

To our knowledge, only the unweighted {wg = 1 for all e G F) two-layer 
planarization problems have been considered in the literature so far. Eades and 
Whitesides [4] showed NP-hardness for the latter two versions of the planariza- 
tion problem and showed that the two layer fixed version can be solved by 
transforming it to a longest increasing subsequence problem. The none layer 
fixed version was first mentioned in [15]. The authors introduced the problem 
in the context of graph drawing. Recently, the weighted two-layer planariza- 
tion problem has been attacked, in which the layers are allowed to be permuted 
freely [11]. The computational results are encouraging. 

Directed graphs are widely used to represent structures in many fields such 
as economics, social sciences, mathematics and computer science. A good visu- 
alization of structural information allows the reader to focus on the information 
content of the diagram. 

A common method for drawing directed graphs has been introduced by 
Sugiyama et al. [14] and Garpano [2]. In the first step, the vertices are partitioned 
into a set of k layers, and in the second step, the vertices within each layer are 
permuted in such a way that the number of crossings is small. In practice, this is 
done layerwise. Keep the permutation of one layer fix while permuting the other 
one, such that the number of crossings is reduced. We suggest an alternative 
approach for the second step. 
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Already for two-layer graphs the straight-line crossing minimization problem 
is NP-hard [6] even if one layer is fixed [5]. Exact algorithms based on branch 
and bound have been suggested by various authors (see, e.g., [9]). For k > 2, 
a vast amount of heuristics has been published in the literature (see, e.g., [14] 
and [3]). A new approach is to remove a minimal set of edges such that the re- 
maining fc-layer graph can be drawn without edge crossings. In the final drawing, 
the removed edges are reinserted. Since the insertion of each edge may produce 
many crossings, the final drawing may be far from an edge-crossing minimal 
drawing. 



4 6 8 7 5 15 14 3 2 13 12 9 1 11 




4 6 3 7 5 8 14 2 12 15 9 13 1 11 




(b) 

Fig. 2. A graph (a) drawn using fc-planarization and (b) drawn with the minimal 
number of crossings computed by the algorithm in [9] 



Figure 2(a) shows a drawing of a graph obtained by two-layer planarization, 
whereas Fig. 2(b) shows the same graph drawn with the minimal number of 
edge crossings (using the exact algorithm given in [9]). Although the drawing 
in Fig. 2(a) has 34 crossings, that is 41% more crossings than the drawing in 
Fig. 2(b) (24 crossings), the reader will not recognize this fact. This encourages 
us to study the fc-layer planarization problem. We decided to first study the case 
fc = 2 in order to learn for the general case fc > 3. 

In Sect. 2 we define the polytope Vi associated with the set of all possible 
two-layer planar subgraphs with respect to a given fixed permutation tta- We 
then point out the relationships to related poly topes. This gives us hints about 
the structure of Vi. In Sect. 3 we give a complete description of the polytope as- 
sociated with all two-layer planar subgraphs when both permutations are fixed. 
This description is useful in the algorithm for solving the two-layer planariza- 
tion problem (1 layer fixed case). Moreover, it provides a different polynomial 
time algorithm for solving the two-layer planarization problem (2 layers fixed). 
In Sect. 4, we investigate the structure of the polytope Vi- We present an irre- 
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dundant integer linear programming formulation and obtain additional classes 
of inequalities that tighten the associated LP-relaxation. In particular, besides 
some new classes of facets, we can show that all facet-defining inequalities of the 
linear ordering polytope transmit to the new polytope V\ . In order to get practi- 
cal use out of these inequalities, we have to solve the “separation problem” . This 
question will be addressed in Sect. 5, where we also discuss a branch-and-cut 
algorithm based on those results. First computational results with a branch-and- 
cut algorithm are presented in Sect. 6. In this extended abstract we omit the 
proofs for most of the theorems. 

2 The Polytope Pi and Its Related Polyhedra 

Let us consider the two-layer planarization problem with one fixed layer more 
precisely: Given a two-layer planar graph G = (A, B, E^tta,*) with a fixed 
permutation tta of the vertices in A, we are looking for a permutation ttb 
of the vertices in B, and a subset F of edges in E such that the subgraph 
G' = {A,B,F,tta,'^b) is two-layer planar under the given permutations tta 
and 7 Tb of the sets A and B, respectively. 

We introduce variables yuv for 1 < m < < |S| representing the permuta- 

tion TTg of the vertices in B. That is, = 1 iff vertex u is before vertex v in ttb 
and ijuv = 0 otherwise. We denote the (row) vector y = (yi, 2 ,yi, 3 , ■ • • ,2/l) with 
L = (^) . (Vectors are row vectors throughout the paper.) Moreover, we introduce 
variables Xe for 1 < e < |if | representing the subgraph induced by F. Variable Xe 
takes value 1 iff e S F and value 0 otherwise. For any tuple {ttb,F), where ttb 
is a permutation and F C E, we define an incidence vector ^ j^l+|£;| 

with the f-th component getting value 1 iff Cj S F and 0 if ^ F for 

i > L, and the j-th component getting value 1 if vertex u is before 

vertex v in ttb and 0 otherwise for j < L. 

Now, we can define the two-layer planar subgraph polytope 

Fi =Fi(A, F, F, 7T^, •) =conv|x^’^^’^^ | ttb is a linear ordering and 
G' = {A, B, F, TTA, 'Xb) is a two-layer planar subgraph of g| 

as the convex hull of all incidence vectors ^jj^t represent a two-layer 

planar subgraph G' = {A, B, F,7ta,ttb) with respect to the valid orderings tta 
and ttb- 

For solving the two-layer crossing minimization problem with one fixed layer, 
we consider the polytope Qi (see [9]). Again, we introduce variables j/y G {0, 1} 
representing the permutation of the vertices in B. The incidence vector x'^^ G 
has the j-th component X^^iUuv) value 1 if vertex u is before v and 0 otherwise. 
The polytope 

Qi = Qi(A, B, F, TTA, •) = conv<^ | ttb is a permutation of the vertices inF 1 
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is identical to the linear ordering polytope that has been studied in [7]. If we 
denote the points in by (y,x), where y G R^, x € we have the fol- 

lowing relationship between the two polytopes and Qi: Qi = Pi n {a; = 0}. 
This fact will lead us to investigate the hereditary property of the facet-defining 
inequalities in Qi for 

The polytope Vo associated with the two-layer planarization problem with 
two free layers (none fixed) has been introduced in [11]. Again, we denote an 
incidence vector having the i-th component x^i^e) value 1 if Xe G F 

and value 0 if Xe ^ F . 



Vo = Vo{A,B,E,»,») = conv|x^ | There exist orderings tta and ttb such that 
G' = (A, B, F, is a two-layer planar subgraph of g| = 

= conv<^ I G' = (A, B, F, •, •) is a two-layer planar subgraph of G [ 



We can use our knowledge of the studied polyhedra Q\ and Vo for our inves- 
tigation of Vi- In particular, all facet-defining inequalities of Qi and Vo are still 
valid inequalities for Vi . Moreover, we will see that all facet-defining inequalities 
of Qi are still facet-defining for Vi. 

Let us consider the two-layer planarization problem when the permutations 
of both layers are fixed. We define 



V2 = V2{A-,B,E,TTA,TrB) = conv{x^ I F C if is a two-layer planar sub- 
graph of G with respect to the orderings tta 
and 7 Tb} 

We have Vi O {y = 0} D V 2 - In the following Section we will consider the 
structure of the polytope V 2 - 



3 A Complete Description of the Polytope P 2 

In this Section we will consider the two-layer planarization problem when both 
layers are fixed. The set of all two-layer planar subgraph of G = (A, B, E, tta, t^b) 
defines an independence system Fp{G) = {E,{E \ F C E induces a two-layer 
planar graph}) on E. Let us examine the circuits and the cliques of this inde- 
pendence system. Circuits are the minimal dependent sets in {E,X) with respect 
to set inclusion. An independence set is called k-regular if each of its circuits is 
of size k. The set F C if is a clique of (F,X), ifjFj > k and all (^^^) fc-subsets 
of E are circuits of {E,X). In [12] it is shown that a maximal clique F C F in 
a fc-regular independence system (F, X) gives a facet-defining inequality, namely, 
the clique inequality 

'^Xe<k-l, ( 1 ) 

eeF 

for Px, the polytope associated with The set of circuits in our system T(G) 

is 



S = {{{p,v), {q,u)} I 7Ta(p) < TTA{q),TrB{u) < ttb{v),{p,v), {q,u) G E}. 
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Hence, 1{G) is a 2-regular independence system. The maximal cliques in T(G) are 
the maximal sets of pairwise intersecting edges (with respect to set inclusion) . We 
show, that the associated maximal clique inequalities and the trivial inequalities 
define the polytope 7^2- 

Theorem 1. The maximal clique inequalities ofX{G) together with the inequal- 
ities a;e < 1 that are not contained in any clique and the trivial inequalities 
0 < Xe for e = give a complete irredundant description of the two- 

layer planar subgraph polytope V 2 (both layers fixed). 

Proof, (sketch) To proof the claim, we build a directed graph R with a single 
source s and a single sink t where every node apart of s and t corresponds to 
an edge in E. When every node has the capacity given by the corresponding 
component of a vector x in [0, 1]'^^ than x belongs to polytope V 2 if and only 
if there is no path in R from s to t where the sum of the capacities of the nodes 
is greater than 1. Every path from s to f corresponds to a maximal clique in 
{E,I) and so a path where the sum of the capacities exceeds one corresponds 
to a violated clique inequality. 

Every vector in V 2 corresponds to a capacity function on the nodes of R such 
that there is no path from s to t where the sum of the capacities is greater than 
one. By shifting capacities in R, we can show that for every weighting of the 
edges in E and for every vector a; in 7^2, there is another vector x' in V 2 with the 
property that every component is either 0 or 1 and the sum of the weights of the 
edges whose nodes in R have capacity 1 is at least as large as the corresponding 
sum for X. Thus, we have a complete description of V2. 

Since the separation problem for the clique inequalities can be solved in 
polynomial time (see Sect. 5), this yields a polynomial time algorithm for the 
two-layer planarization problem via the Ellipsoid method. 

There is also a combinatorial algorithm for solving the problem. Eades and 
Whitesides [4] give a transformation of the unweighted two-layer planarization 
problem to the longest increasing subsequence problem. A similar transformation 
to the heaviest increasing subsequence problem works for the weighted version 
of the problem. 

Lemma 1. By transforming the two-layer planarization problem to an instance 
of the heaviest increasing subsequence problem, it can be solved in time 
0{\E\log\E\). 

Both theorems are not surprising, since there are similar results for the trace 
polytope T 2 on two sequences that has been introduced in [13] in the context 
of multiple sequence alignment. The set of circuits in the independence system 
It{G) is 

S U {{{p,u),{p,v)} I ttb{u) < ttb{v),{p,u) e E,{p,v) G E} 

U {{{P, u), {q, m)} I tta{p) < TTA{q), (p, u) G E, {q, u) G E}. 

In the following, we investigate the relations between the two-layer planar 
subgraph polytope V 2 and the trace polytope 72. 
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Lemma 2. Let G = (V, E) he a graph. There exist transformations from G to 
G' = {V',E') and G" = {V”,E") with E' ^ E ^ E" , |1/'| = \V"\ = 2\E\, and 

V 2 {G) ^ V 2 {G') ^ T 2 {G') and T 2 {G) ^ T 2 {G") ^ V 2 {G"). 

4 The Structure of the Polytope Pi 

First, we give an integer linear programming formulation for the two-layer pla- 
narization problem with one fixed layer. The notation is based on the previous 
Section. Let G = (A, B, E, tta, •) be a two-layer graph and let w G be the 
cost vector on the edges. Then, the two-layer planarization problem is to solve 

maxjwa;^ | (y,x) G Vi,y G G 

We are interested in the integer points of Vi- 

Theorem 2. The integer points of the two-layer planar subgraph polytope Vi = 
Vi{A, B, E,tta,^) are charaeterized by the following system of inequalities: 



-yuv-yvw + yuw<0 l<u<v<w<\B\ (2) 

Vuv + yvw - Vuw < ^ l<u<v<w<\B\ (3) 

0 < yuv <1 1 < u < V < w < \B\ (4) 

yuv integral 1 < u < v < w < \B\ (5) 

Vuv + X(^p^u) + <2 u < v^TTAiq) < TTA{p),ip,u),{q,v) G E (6) 

-yuv + X(^p^u) X(^q^y) < 1 u <v,tta{p) <T^A{q),{Pju),{q,v) G E (7) 

0<Xe<l 1 < e < |F;| (8) 

Xe integral 1 < e < |if| (9) 



Proof. The inequalities (2)-(5) require the variables to represent a linear or- 
dering 'Kb- Inequalities (6)-(9) are responsible for introducing no crossing with 
respect to the ordering kb given by the vector y. In particular, inequalities (6) 
and (7) link together the subgraph variables x and the linear ordering vertices y. 
A crossing between two edges (p, m) and (q,v) occurs either if KA{q) < 'X'Aip) 
and u is before v in the ordering kb given by y, or if ka{p) < 'XA{q) and v is 
before u in kb. 

Next, we address the question if the description given in Theorem 2 is tight. 

Theorem 3. The description given in Theorem 2 is an irredundant description 
of the two-layer planar subgraph polytope V\ = 'Pi(A, B, E, ka, •)■ In particular, 
the inequalities (2)-(4) and (6)-(8) are facet- defining for Vi. 

In order to prove the facet-defining property of the inequalities, it is essential 
to know the dimension of the poly tope. 



Lemma 3. The dimension of the two-layer planar subgraph polytope 
V\ = Vi{A, B, E, ka,*) is L -\- \E\, where L = (^) . 
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Proof. We know from [7] that the linear ordering polytope is full dimensional. 
For every ordering of the nodes, a two-layer graph with only one edge is two-layer 
planar. So we can easily construct a set of L -I- \E\ affinely independent vectors 
that correspond to two-layer planar graphs. 

In Sect. 2 we have seen that is closely related to the linear ordering 
poly tope Qi- The following theorem gives us the possibility to use the knowledge 
of the well-studied polytope Q\ for . 

Theorem 4. Let < cq be a facet- defining inequality of the linear ordering 
polytope Qi = Qi{A, B, E,tta, 9). Then cy^ < cq is also facet- defining for the 
two-layer planar subgraph polytope Vi = V\{A, B, E, tta, •). 

For the rest of this Section we will concentrate on new facet-defining inequal- 
ities for V\ ■ Our practical experiments have supported the need for inequalities 
containing only a;- Variables. 

We define a blocker B = {u,l,r) to be a subgraph of G = {A, B, E,tta,*) 
containing the edges (l,u) and (r,u) with tta{1) < 7TA ( r ). We use the notation 
x{B) = X(^i u)+X(^r,u)- A blocker forbids certain edges. An edge e = (j>, w) crosses 
blocker B = (uj,r) iff 7 ta(0 < '^a{p) < T^Air) and w ^ u. This fact leads to 
e-blocker inequalities which are valid and in some cases facet-defining for Vi- 
Figure 3 shows some examples of configurations leading to these inequalities. 



Fig. 3. Examples for support graphs of facet-defining e-blocker inequalities 



Theorem 5. Let Bi, . . . , Bk be a set of blockers Bi = (ui,li,ri) with Ui yf Uj 
for i,j = i yf j, and let e = (j>,w) G E be an edge which crosses all 

blockers Bi. Then, the e-blocker inequality 



is valid for Vi{A, B, E,tta,^). If U = Ij and ri = Vj for i,j = l,...,k, it is 
facet- defining for V\ {A, B, E, tta, •) ( even for k = 1). 

5 The Algorithm and Separation Routines 

The separation problem is to decide for a given vector x and a polytope V, 
whether x G V, and, if a; ^ 7^, find a vector d and a scalar do such that the 
inequality dsf^ < do is valid with respect to V and dx'^ > do. 




o ® 

(a) 



(b) 



k 




( 10 ) 
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Lemma 4. The separation problems for the inequalities (2)-(4),(6)-(8), 
and (10) can he solved in polynomial time. 

For the two-layer planarization problem with one fixed layer, we implemented 
a branch-and-cut algorithm using the ABACUS-System [10]. Because of space 
limits, we cannot describe our branch-and-cut algorithm in more detail. We use 
separation routines for the inequalities given in Lemma 4 in order to get good 
upper bounds. Moreover, we try to use some information given to us by fractional 
solutions in order to get good lower bounds. 

Our studies of the two-layer fix planarization problem is useful in two ways: 
For getting good lower bounds, we frequently use the combinatorial algorithm 
for the two-layer fix version given in Lemma 1. The upper bounds can be im- 
proved using the following strategy: In every branching step on variables in y, 
we select a variable puv and set it to either 0 or 1 . In the subproblems below this 
branching node, we have decided on a partial order for the vertices in B. For 
the partially ordered subsets, we can use the inequalities given by the complete 
description for V 2 (see Sect. 3). Next, we will see that the separation problem 
for the inequalities (I) can be solved in polynomial time. 

Theorem 6. For the maximal clique inequalities, the separation problem can be 
solved in polynomial time by computing at most \E\ shortest path problems. 

According to earlier results (e.g., [8]), we can optimize a linear objective 
function over a polytope in polynomial time if and only if we can solve the 
separation problem in polynomial time. Hence, Theorem 6 gives us a polynomial 
time algorithm for solving the two-layer planarization problem when both layers 
are fixed. 



6 Computational Results 

To test the performance of our branch-and-cut algorithm for the two-layer pla- 
narization problem (1 layer fixed), we worked with the graphs from [1] that are 
called the North DAGs. These directed acyclic graphs have 10 to 100 nodes. 
We distributed them into sets Gi with i running form 1 to 9 such that the 
set Gi holds the graphs where the number of nodes is at least lOi and at most 
lO(i-l-l) — 1. We worked on 12 randomly chosen graphs out of each of the sets Gi. 
For each of the graphs, we distributed the nodes into pairwise disjoint sets Lj 
(called layers) such that for all edges the start-node is on a layer with smaller 
index than the end-node. This can be done using topological sorting. 

After Inserting some dummy nodes, we get for each graph a number of bi- 
partite graphs that consist of the nodes on two neighboring layers and the edges 
between these nodes. For a graph with k layers, we get k—1 bipartite graphs Bi 
to Bk-i where Bi consists of the layers Li and Li+i. We used B\ as input for 
our algorithm for solving the two-layer planarization problem (none layer fixed) 
resulting in a permutation for the layers L\ and L^. Then we applied the algo- 
rithm for the problem with one fixed layer to the rest of the problems beginning 
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with i? 2 - Every optimization was stopped after 5 minutes if no optimum solution 
was found before. The following tabular shows for each set Gi the average op- 
timization time (in seconds) for each layer and the maximum time used in any 
of the graphs of the set on a Sun Ultra Sparc 2/2x200. In all the 108 graphs we 
tested, there were only 4 graphs for which a planarization-problem could not be 
solved to optimality in 5 minutes computation time. 



Nodes 


10-19 


20-29 


30-39 


40-49 


50-59 


60-69 


70-79 


80-89 


90-99 


Average 


0.17 


7.07 


19.96 


1.73 


1.37 


150.61 


12.64 


73.07 


5.83 


Maximum 


1.1 


17.0 


78.53 


14.1 


27.04 


300.16 


116.8 


300.14 


50.16 
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Abstract. This paper addresses the classical graph drawing problem 
of designing an algorithm that compntes an orthogonal representation 
with the minimum number of bends, by considering all possible planar 
embeddings of the graph. While the general problem has been shown 
to be AP-complete [7], polynomial time algorithms have been devised 
for graphs whose vertex degree is at most three [5]. We show the first 
algorithm whose time complexity is exponential only in the number of 
vertices of degree four of the input graph. This settles a problem left as 
open in [5]. Our algorithm is further extended to handle graphs with 
vertices of degree higher than four. The analysis of the algorithm is 
supported by several experiments on the structure of a large set of input 
graphs. 

1 Introduction and Overview 

Graph drawing is concerned with the design of methods for the automatic display 
of graphs so as to emphasize important features of such graphs and produce aes- 
thetically pleasing visualizations. Various graphic standards have been proposed 
to draw graphs, each standard being devoted to a specific class of applications. 
For example, orthogonal drawings, i.e. drawings where edges are represented as 
chains of horizontal and vertical segments, are widely used in applications such 
as data-base systems (E-R diagrams), software engineering (data-flow diagrams), 
and circuit design (circuit schematics) . An extensive survey on the different graph 
drawing standards and relative algorithms can be found in [4]. 

Most graph drawing algorithms are designed to work in a fixed embedding 
setting: they receive as input a graph G with information about its topology and 
produce as output a drawing of G that preserves such information. For a planar 
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Grafica ed al CAD.” 
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graph, such topological information is typically represented by its planar embed- 
ding, i.e. the circular ordering of the edges incident on the vertices. If the input 
graph is not planar, the first step of the algorithm is typically a planarization 
step that determines a planar embedding of the graph, possibly adding dummy 
vertices for crossings. The planarization step is then followed by a representa- 
tion step, that computes a drawing with the given embedding and removes the 
dummy vertices. Notice that the choice of a planar embedding can deeply affect 
the output of the drawing algorithm. Thus, it naturally rises the problem of 
designing algorithms that work in a variable embedding setting, i.e. algorithms 
that are allowed to change the planar embedding of the input graph, in order to 
optimize the output with respect to a given set of aesthetic requirements (see, 
e.g. [1,7,2,5]). 

In this paper we study the problem of computing orthogonal drawings with the 
minimum number of bends in a variable embedding setting. Orthogonal draw- 
ings are a classical field of investigation in graph drawing. A very limited sample 
of papers on orthogonal drawings includes [13,14,11,3,12]. In [7] it is shown that 
the problem of computing an orthogonal drawing with the minimum number of 
bends is NP-complete. On the other hand, polynomial time algorithms for series- 
parallel graphs and 3-planar graphs are proposed in [-5]. The algorithm in [2] 
computes an orthogonal drawing with the minimum number of bends for bicon- 
nected 4-planar graphs, by considering the set of all planar embeddings of the 
graph with a branch-and-bound technique. Since the number of such embeddings 
is in general exponential with the number of vertices of the graph, the algorithm 
can require exponential time. The research described in this paper starts from 
the following observation: based on the existence of a polynomial time solution 
for 3-planar graphs, it would be desirable to have an algorithm whose complexity 
is exponential only in the number of vertices of degree 4. This can be of partic- 
ular interest if such number is a small fraction of the total number of vertices. 
Consider for example the graph of Figure 1, that has 45 vertices, only one vertex 
of degree 4, more than 2^^ different planar embeddings, and whose orthogonal 
drawing with the minimum number of bends can be computed in polynomial 
time by an easy variant of the algorithm in [-5]. Devising a 0(c"^p(n)) algorithm 
for computing orthogonal drawings with the minimum number of bends in a 
variable embedding setting is also mentioned as an open problem in [5]. Here, 
c is a constant, is the number of vertices of degree 4 of the input graph, 
and p(n) is a polynomial in the total number of vertices of the graph. In the 
paper, we exploit a vertex expansion technique to devise a 0(6"'‘n^ logn)-time 
algorithm, that receives as input a biconnected 4-planar graph G and computes 
an orthogonal drawing of G with the minimum number of bends. The analysis 
of the algorithm is supported by several experiments on the structure of a large 
set of input graphs. The algorithm is further extended to handle biconnected 
planar graphs with vertices of degree higher than 4. Ideas related to those that 
underly our vertex expansion technique can be found in [9]. For lack of space, 
proofs have been omitted in this extended abstract and will appear in the full 
version of the paper (see also www.dia.uiiiroina3.it/~didimo/papers). 
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Fig. 1. An orthogonal drawing of a graph with 45 vertices, 1 vertex of degree 4, 
and more than 2^^ different planar embeddings. The graph has been drawn with 
the GDToolkit system (www.dia.uniroma3.it/people/gdb/wpl2/). 



2 Preliminaries 

We assume familiarity with planarity and connectivity of graphs [10]. Since we 
consider only planar graphs, we often use the term embedding instead of planar 
embedding. A graph is said to be k-planar if it is planar and its vertices have 
degree at most k, where k is an integer constant. The degree of a vertex v is 
denoted as deg{v). 

Let G be a biconnected planar graph. A split pair of G is either a separation 
pair of G or a pair of adjacent vertices. A split component of a split pair {m, w} 
is either an edge (u, w) or a maximal subgraph Guw of G such that {u, u>} is not 
a split pair of Guw Vertices u and w are the poles of the split component. Let 
(s,t) be an edge of G called reference edge. An SPQ*R-tree T of G (a simple 
variation of the SPQR-trees introduced in [6]) is a tree describing a recursive 
decomposition of G with respect to its split pairs, and it is used to synthetically 
represent all the embeddings of G with (s,f) on the external face (we always 
assume, unless stated otherwise, that (s,t) is on the external face). Nodes of T 
are of four types: S, P, Q* and R. Each node p has an associated graph called 
skeleton of p and denoted by skeleton(p). Starting from the reference edge, T is 
recursively defined as follows: Chain ease, if G consists of a simple path from s 
to t then T is a single Q*-node p whose skeleton is G itself. Series case: if G is 1- 
connected, let ci,. . . , Ck-i {k > 2) be the cutvertices of G such that no cutvertex 
has degree less than three; ci,. . . , Ck-i partition G into graphs Gi,. . . , Gfc. The 
root of T is an S'-node p. Graph skeleton{p) is the chain ei, ...e^, where edge Ci 
goes from Ci-i to Ci, cq = s and Ck = t. Parallel case: if s and t are a split pair 
for G with split components Gi,. . . , Gk {k > 2), the root of T is a P-node p. 
Graph skeleton(p) consists of k parallel edges from s to t, denoted ei,...,efc. 
Rigid case: if none of the above cases applies, let {si,ti} ,...,{sk,tk} be the 
maximal split pairs of G (fc < 1), and for i = l..k, let Gi be the union of all the 
split components of The root of T is an i?-node p. Graph skeleton(p) 

is obtained from G by replacing each subgraph Gi with edge e, from Si to ti. 
We call pertinent graph of p the graph whose decomposition tree is the subtree 
rooted at p. Also, the virtual edge of p is the edge representing the pertinent 
graph of p in the skeleton of its parent. An R-component ( S -component, P- 
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component) of G is the pertinent graph of an i?-node (S'-node, P-node) of a 
decomposition tree of G. 

Let If' be a planar embedding of G and let / be a face of G. / is a rigid face if all 
its edges belong to the skeleton of an i?-node of T. An immediate consequence 
of the properties of S, P, Q*-trees is the following. 

Property 1 . If / is a rigid face in a planar embedding of G, then the edges of / 
form a rigid face in any planar embedding of G. 

A planar drawing of a planar graph G such that all edges of G are mapped 
to polygonal chains of horizontal and vertical segments is an orthogonal drawing 
of G. A planar graph has an orthogonal drawing if and only if it is 4-planar. 
Two orthogonal drawings P and P' of G, preserving the same embedding of G, 
are shape equivalent if for each vertex ri, consecutive edges in the adjacency list 
of V form the same angle in the two drawings, and for each edge (m, v) following 
from u to V the polygonal chain representing (u,v), we have the same (possi- 
bly empty) sequence of left and right turns in the two drawings. An orthogonal 
representation iL of G is a class of shape equivalent orthogonal drawings of G. 
We say that H preserves the embedding of G when the shape equivalent or- 
thogonal drawings in the class H preserve the embedding of G. We also say 
that H is optimal within the given embedding if it has the minimum number of 
bends among all orthogonal representations of G that preserve its embedding. 
Finally, H is optimal, if it has the minimum number of bends over all the possible 
(not necessarily embedding preserving) orthogonal representations of G. Turns 
on a polygonal chain of H are called bends. We denote by b{H) the total number 
of bends of H. 

Let H be an orthogonal representation of a 3-planar graph and let / be an 
internal face of PI, such that / consists of 4 vertices and has no bends (i.e. / 
is a rectangle with a vertex at each corner). Let v he a, vertex of / such that 
deg{v) = 3 and let e' and e" be the two edges of / that share v. Edge e' (e") 
is the ingoing (outgoing) edge of u in / if the triplet e',v,e" is encountered in 
this order when walking around / in clockwise direction. Face / is said to be a 
counterclockwise (clockwise) face if for every vertex v oi f such that deg(v)=3, 
the edge e oi PI incident on v and not belonging to / forms & it j 2 angle with 
the ingoing (outgoing) edge of v (see Figure 2). 

3 Expanding Graphs and Representations 

Let G be a biconnected 4-planar graph with a given planar embedding P and 
let T be a planar drawing of G that preserves P. Let u be a vertex of P such that 
deg(v) = 4 and let 61,62,63,64 be the edges incident on v in circular clockwise 
order. We construct a planar drawing from P as follows: (1) v is replaced in P by 
a simple cycle fy, called expansion of v, that consists of vertices ui, '62, "63, '64 in 
clockwise order around /„. (2) each edge Cj = (ui, v) of P is replaced by an edge 
ii = (ui, Vi), i = 1 , . . . , 4. Let T be a drawing obtained by recursively applying 
operations (1) and (2) to P, for each vertex of P that has degree larger than 
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(a) (b) 

Fig. 2. (a) A counterclockwise face, (b) A clockwise face. 



three. See, for example, Figure 3 (b). We call the planar graph represented by F 
the expanded graph of G and denote it as G. It is univocally determined by F. 

Property 2. The expanded graph of G is a 3-planar graph. 

Property 3. The expanded graph of G has n+3n4 vertices, where n and denote 
the number of vertices and the number of degree 4 vertices of G, respectively. 

J’he class of drawings of G that are equivalent to P defines an embedding 
of G that is called the expanded embedding of F. 

Property 4- In the expanded embedding of W an expansion of a vertex of G is 
an internal face. 

A face that is the expansion of a vertex of G is also called expanded face. 

Theorem 1. Let G he an embedded biconnected A-planar graph and let G be an 
expanded graph of G . Let v be a vertex of G such that deg{v) = 4 and let fy be 
the expansion of v in G. Cycle fy is a rigid face in any planar embedding of G. 

Let G be an embedded 4-planar graph, let F be the embedding of G, let G be 
the expanded graph of G, and let F be the embedding of G. We now study the 
relationship between the set of orthogonal representations of G that preserve F 
and a particular set of orthogonal representations of G. Namely, we consider the 
set 7i~ of orthogonal representations of G such that: each orthogonal represen- 
tation H of Ti,~ preserves <F, and each face of G resulting from the expansion of 
a vertex of G is represented in iJ as a counterclockwise face. An element of Ti,~ 

is called counterclockwise orthogonal representation of G. Let be the set of 
all orthogonal representations of G that preserve F. 

Lemma 1. For any jirthogonal representation H € Phf there exists an orthog- 
onal representation H £ 7i~ such that b(H) = b{H). Also, for any orthogonal 

representation H S Ti.~ there exists an orthogonal representation H € Tic such 
that b{H) = b{H). 
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Fig. 3. (a) a vertex v and its expansion /^; (b) a drawing F and a drawing F 
obtained by recursively applying operations (1) and (2) 



Theorem 2. Let G be an embedded A-planar graph, let F be the ^embedding of G, 
let G be the expanded graph of G, and let W be the embedding of G. There exists a 
bisection £ between the set TL^ of orthogonal representations of G and the set TL~ 

of counterclockwise orthogonal representations of G. Also, for any FI G TL^ we 
have that b(H) = b{£{F[)). 

In the rest, for an orthogonal representation H € Tfip we call its image 
F[ = £{H) the expansion of FI. Also, FI = £~^{H) is the contraction of H . 

4 Optimal Orthogonal Drawings 

Our general strategy for computing optimal orthogonal drawings consists of two 
main steps. The input is a biconnected 4-planar graph G and the output is an op- 
timal orthogonal drawing of G. We use SPQ*R-trees to handle the embeddings 
of G. Since a decomposition tree of G implicitly represents the embeddings of G 
with a reference edge on the external face, for each choice of a reference edge e 
we compute an optimal orthogonal drawing F of G, such that e is on the ex- 
ternal face (Computation Step). After the execution of the Computation Step 
on each possible choice of the reference edge, we select the orthogonal drawing 
with the minimum number of bends (Selection Step). Since there are 0(n) 
possible choices of the reference edge, the overall time complexity of a drawing 
algorithm based on the above strategy is 0{n x t{n)), where t{n) is the time 
complexity of the Computation Step. 
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Now, we focus on the Computation Step. From the assumption that G is 4- 
planar and using the properties of the decomposition trees, the following lemma 
gives an upper bound on the number of embeddings of G with e on the external 
face. We denote with Ve the set of such embeddings. 

Lemma 2 . \Ve\ < 6 ”, where n is the number of vertices of G. 

Based on Lemma 2, a brute-force approach to the Computation Step is as 
follows: (i)For any embedding iF of Ve compute an optimal orthogonal draw- 
ing that preserves W by means of the O(n^logn) algorithm in [13]. (ii)Choose 
the drawing with the minimum number of bends among the computed ones. 
Hence a Computation Step based on this approach requires 0(6”n^ logn) time. 
The bound can be further reduced to 0(6"n~ ^/logn) time with the technique 
presented in [8]. However, as also pointed out in Section 1, the Computation 
Step can be executed in polynomial time if G is a 3-planar graph [5]. Thus, we 
aim at devising an alternative strategy for such step, that is exponential in the 
number of degree 4 vertices of G. Our idea is to execute the Computation Step 
as follows: Task (i): The embeddings in Ve are grouped into 0(c”*) equiva- 
lence classes. Task (ii): For each class of equivalent embeddings, an orthogonal 
drawing with the minimum number of bends is computed in polynomial time. 
Task (iii): Among the 0(c”'‘) computed orthogonal drawings, the cheapest one 
is chosen. 

Regarding Task (i), we need to define a new notion of equivalence between 
the embeddings of G. Let <Fi and W 2 be two embeddings in Ve- 'V\ and W 2 are 
4-equivalent if they have the same circular counterclockwise ordering of the edges 
around each vertex v of G such that deg{v) = 4. Hence, Ve can be partitioned 
into equivalence classes such that any two embeddings in the same class are 4- 
equivalent. A 4-embedding is a class of 4-equivalent embeddings in Ve- We denote 
with vjf^^ the set of 4-embeddings of Ve- 

Lemma 3. [T^i^^j < 6 "'*, where 714 is the number of vertices of degree 4 of G- 

The following lemma is a consequence of Lemma 3 and of a variant of the 
algorithm in [2] that generates all embeddings of a given graph. 

Lemma 4. Let G be a biconnected 4-planar graph with a given reference edge e 
and ri 4 vertices of degree 4. The set of 4- equivalent embeddings can be com- 
puted in 0 ( 6 ”^) timc- 

For a given reference edge e of G, let be a 4-embedding in vi^\ An 
orthogonal representation of G that is optimal within the embeddings in 
is called optimal within the 4-embedding and is denoted as Task (ii) 
is accomplished by computing for each a corresponding To do this 

in polynomial time, we exploit the expansion theory of Section 3 as follows. 
We consider the expanded graph G of G induced by an embedding in 
By Property 2 G is 3-planar and by Property 3 it has 0(n) vertices. In [5] 
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it is shown an algorithm that computes an optimal orthogonal representation 
of a 3-planar graph in polynomial time. First, the graph is decomposed in 
its S-, P-, and i?-components by means of an SPQ* R-tree. Then, for each S-, P-, 
and i?-component, a suitable set of orthogonal representations is computed, and 
such representations are combined to compose an optimal one. In particular, for 
each i?-node the set of orthogonal representations is computed by solving a min- 
imum cost flow problem on a flow network associated to the skeleton of the node. 
Since by Theorem 1, each expanded face in G is rigid. By adding constraints to 
the minmum cost flow problem studied in [-5] , a variant of the that algorithm can 
be devised to compute an orthogonal representation H of G such that: (a) Its 
embedding is the expanded embedding of an element in (b) Its expanded 
faces are counterclockwise, (c) It has the minimum number of bends among all 
orthogonal representations of G satisfying (a) and (b) . 

By means of techniques similar to those presented in [-5] the following can be 
proved. 

Lemma 5. PI can he computed in 0{n^logn) time. 

By applying Theorem 2 we have the following result. 

Lemma 6. The contraction of PI is an orthogonal representation of G optimal 
within . 

Lemmas 5 and 6 give rise to the following. 

Theorem 3. Let G be a biconnected 4-planar graph with n vertices and let 
he the number of vertices of G that have degree 4. There exists an algorithm 
that computes an orthogonal drawing of G with the minimum number of bends 
in 0(6'^'^ log n) time. 

The algorithm described above can be extended to compute optimal orthogo- 
nal drawings of graphs with high degree. Namely, the notion of expanded graphs 
can be readily defined also to graphs with high degree vertices. In this case, 
the expanded face /„ of a vertex v such that deg{v) = k consists of k vertices. 
Vertex v is then represented in the computed optimal orthogonal drawing as a 
rectangular face that actually corresponds to /„. By exploiting the 3-planarity 
of the expanded graph, we can state the following. 

Theorem 4. Let G be a biconnected k-planar graph with n vertices, where 
k > A is the maximum vertex degree. Let h be the number of vertices of G 
that have degree larger or equal to 4. There exists an algorithm that computes 
an orthogonal drawing of G with the minimum number of bends in 0{{k — 1)!" 
((fc — l)n)^ log((fc — l)n) time. 

We performed several experiments in order to better understand the behav- 
ior of our algorithm on a large set of graphs. We considered a test suite of more 
than 2000 graphs, with number of vertices in the range 10-200, and having a 
number of 4 degree vertices linear with the total number of vertices (Figure 4 (a)). 
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Fig. 4. Charts of the experiments: the a:-axis represents the number of vertices. 
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The graphs have been generated by a technique that has been already exploited 
for other experimental studies on orthogonal drawings [2]. The implementa- 
tion uses the GDToolkit library (www.dia.uniroma3.it/people/gdb/wpl2/). 

We observed the quantity ' ' (Figure 4 (d)), in order to measure in practice 

the factor c”'* affecting the overall time complexity of the algorithm. Also, we 

\'pW I 

measured the quantity (Figure 4 (e)), in order to estimate the effectiveness 
of our algorithm with respect to a brute force approach. 

Figure 4 (b) shows the structure of the graphs in terms of the number of 
their distinct S'-, P- and i?-nodes of the decomposition tree. The distribution of 
vertices of degree 4 in such components is depicted in Figure 4 (c). 
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Abstract. In orthogonal graph drawing, edges are represented by se- 
quences of horizontal and vertical straight line segments. For graphs of 
degree at most four, this can be achieved by embedding the graph in a 
grid. The number of bends displayed is an important criterion for layout 
quality. A well-known algorithm of Tamassia efficiently embeds a planar 
graph with fixed combinatorial embedding and vertex degree at most 
four in the grid such that the number of bends is minimum [23]. 

When given a dynamic graph, i.e. a graph that changes over time, one 
has to take into account not only the static criteria of layout quality, but 
also the effort users spent to regain familiarity with the layout. Therefore, 
consecutive layouts should compromize between quality and stability. We 
here extend Tamassia’s layout model to dynamic graphs in a way that 
allows to specify the relative importance of the number of bends vs. 
the number of changes between consecutive layouts. We also show that 
optimal layouts in the dynamic model can be computed efficiently by 
means that are very similar to the static model, namely by solving a 
minimum cost flow problem in a suitably defined network. 



1 Introduction 

One way of reducing perceptual complexity in visualized networks is to repre- 
sent all edges by alternating sequences of horizontal and vertical straight line 
segments. Technical requirements, as in VLSI layout, may be another reason for 
choosing this form of representation. Along with the generally increasing interest 
in graph drawing (see [11] for a survey), significant attention has been devoted 
to orthogonal graph layout. Many efficient algorithms have been designed, and 
many properties of orthogonal layouts (such as the number of bends, the num- 
ber of crossings, or the area needed) have been analyzed [23,4,2,20,6, and many 
more] . 

For planar graphs with vertex degree at most four, Tamassia [23] gave an 
efficient algorithm computing a grid layout with the minimum number of bends 
preserving a given combinatorial embedding of the graph. If the embedding is not 
fixed in advance, bend minimization becomes AfT^-complete [16]. The algorithm 
is well-known both because of its conceptual elegance and because the number of 
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bends seems to be an important criterion for layout quality [22]. It is based on the 
correspondence of flow in an associated network and angles in the embedding. 
The connection between angles in straight-line representations and values of 
certain linear programs was investigated by a number of authors [25,18,12,15], 
but is not one-to-one in general. Tamassia’s algorithm has been extended to 
graphs of higher degree [14], and it builds the core of an algorithm for non- 
planar graphs of arbitrary degree [24]. 

In many settings such as user interaction, software visualization, animation 
of graph algorithms, or graph queries, it is required to deal with dynamic graphs, 
i.e. graphs that change over time. When a user analyzes a graph visually, he or 
she builds a mental map that ought not change dramatically when the graph is 
modified [13]. There are several approaches to dynamic layout [10,7,19], but most 
research on orthogonal layout has focused on incremental updates, in which only 
creation of new vertices and edges is allowed [21,5]. InteractiveGiotto [3] is 
a system allowing arbitrary updates, but the layout of remaining portions of the 
graph is fixed, which generally increases the number of bends needed. Here, we 
apply the Bayesian framework of [8] to obtain a dynamic model which forms 
an explicit and controllable compromise between layout stability and layout 
quality. Interestingly, the corresponding optimization problem can still be solved 
efficiently by network flow techniques. 

This paper is organized as follows: In Sect. 2, we review the bend minimum 
grid embedding of [23]. After deriving an objective function for dynamic layout 
in Sect. 3, we show how to compute a layout accordingly in Sect. 4. Forms of user 
interaction are discussed briefly in Sect. 5, and examples are given in Sect. 6. 

2 Static Bend Minimum Layout 

A graph is said to be ^-planar, if it is planar and has vertex degree at most 
four. For the orthogonal layout of connected 4-planar graphs with fixed com- 
binatorial embedding, Tamassia [23] generates a flow network and computes a 
minimum cost flow corresponding to an orthogonal representation with the min- 
imum number of bends preserving the embedding. In this section, we review 
briefly the concept of orthogonal representation and the construction of the flow 
network associated to the graph. 

Let G be an embedded, connected, 4-planar graph. As shown in Fig. 1, an 
orthogonal representation H(G) of G = (V,E) consists of circular lists Hf = 
[ (eo, So, ao), . . . , {eddf)-!, Sda(f)-i,adG(f)-i) ] for each face / in G. Each triple 
(ei,Si,ai) of list Hf consists of an edge et G E, a string Si G {0,1}*, and an 
integer Ui G {90, 180, 270, 360}, such that ei, . . . , edG(f)-i is a clockwise (coun- 
terclockwise, if / is the outer face) traversal of the edges incident to /. Note that 
edges incident to vertices of degree one appear twice in this traversal. In Si, O’s 
and I’s represent 90 and 270 degree bends, respectively, on the right side of in 
the traversal. Finally, Oi is the size of the angle between and e-_^^ mod ddf)' 

Lemma 1 ([23]). If a b-bend orthogonal representation of an n-vertex graph is 
given, a eorresponding grid embedding ean he computed in time 0(n + b). 
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A flow network N = (W, A;b,l, u, c) consists of a directed graph {W, A) and 
functions b {supply / demand) ^ I {lower capacity), u {upper capacity), and c {cost) 
defined on the set of arcs. We here assume that all four take integer values and 
that lower capacities are nonnegative. To avoid confusion with the vertices of 
the planar graph to be embedded, the elements of W are called nodes. A flow 
X = with cost ‘-(®) ’ integer valued vector such that 

Ku!) + = 0 for all w e W 

{w' .,w)^A {w 

l{a) < Xa < u{a) for all a € A 

For the existence of a feasible flow, it is necessary that the total mass balance con- 
straint, i® satisfled. A vector x satisfying Y,(w' ,w) = 

S(uj w')^A^(w,w')i w € E^nd l{a) < Xa < u{a), a € A, is called a circulation. 
See [1] for a comprehensive treatment of network flows. 

The flow network N{G) used to compute some H{G) is constructed such that 
each unit of flow corresponds to a 90 degree angle in H{G). Therefore, the node 
set W = Wv U Wp of N{G) consists of all vertices v & V and faces / G F of G, 
i.e. Wv = V and Wp = F. Likewise, two types of arcs make up A = Ay U Ap-. 
For every occurrence of a vertex v in the traversal of a face f € F there is 
an arc {v,f) G Ay. For every edge incident to neighboring faces / and g there 
are arcs {f,g),{g,f) G Ap. Every vertex node v G Wy has supply b{v) = 4, 
while face nodes / G Wp have demand 

,/ f \ _ / — (2 • da{f) — 4) if / is an internal face 
\ — {2-dc{f) + 4:) if / is the external face 

By Euler’s formula, J2veWv total mass bal- 

ance constraint is satisfled. The lower and upper capacity of a vertex-face arc 
{v,f) G Ay are l{v,f) = 1 and u{v,f) = 4, respectively, while a face-face arc 
€ Ap has l{f,g) = 0 and infinite upper capacity. Arcs in Ay have zero 
cost, while there is an integer cost 7 > 0 on arcs in A^t-. See Fig. 1. 

Theorem 1 ([23]). // 7 > 0, there is a 1-1 correspondence between feasible 
flows in N{G) and orthogonal representations of G (up to the ordering of bends 
on each edge). Moreover, the total cost of a flow in N{G) corresponding to an 
orthogonal representation with b bends equals 7 • b. 



3 Dynamic Layout Model 

In the remainder of this paper, let G, G*^^^ = and G^^^ = {V^'A , A^'A) 

be any triple of connected, embedded, 4-planar graphs. G^‘^'> succeeds G^A in a 
sequence of graphs and is to be laid out with few bends and changes with respect 
to a given layout of G^^^. 

In order to obtain a suitable dynamic layout model we first formulate Tamas- 
sia’s static model in a slightly different way: Given a connected, embedded, 4- 
planar graph G and its associated flow network N{G), the range of variables Xa, 
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Orthogonal representation 
(this face only) 

(eo,e, 90) 

(ei,e,90) 

(62.1.90) 

(63.6.90) 

(64,0, 180) 



Fig. 1. Part of an embedded 4-planar graph G and its associated flow net- 
work N{G). Vertex nodes v € W have h{v) = 4, while the face node f G W 
has b{f) = —(2 • 5 — 4) = — 6 . Thick arcs have capacity [1,4] and cost zero, the 
others have capacity [0, oo) and cost 7 . All arcs are labeled with values of a fea- 
sible flow, and the {grey) edges of G are routed according to the corresponding 
orthogonal representation 



a S A, is {/(a), ... , w(a)} = Xa- Let X be the cartesian product of all Xa, a G A, 
and X C X he the set of feasible flow vectors. By Theorem 1, each x G X 
represents an orthogonal representation of G. The objective function of the min- 
imum cost flow problem is U{x) = = SaeAp 1 ' ^a, where 7 is a 

nonnegative integer. A random variable X with distribution 

P{X = x) = { nxGA. 

10 otherwise 



and Z = yields a constrained random held formulation of 

the layout problem, in which configurations corresponding to a minimum cost 
flow have the highest probability. The random held framework for layout models 
was introduced in [9], and it is shown in [ 8 ] how dynamic layout models com- 
promizing between readability and stability can be obtained through a Bayesian 
approach. Therefore, let V be a random variable for the layout of and let X 
be the random variable for its successor G^‘^\ The Bayesian formula gives 



P{X = x\Y = y) 



P{Y = y\X = x) ■ P{X = x) 
P{Y = v) 



for allx G X and y G y. Since y is given in advance, maximization of the left hand 
side is equivalent to maximization of the product in the nominator of the right 
hand side. But the latter consists of a prior distribution P{X = x) corresponding 
to the static layout model for G^^\ and the likelihood of a;. It was argued in [ 8 ], 
that this likelihood should be used to express criteria of stability. Since the only 
layout variables in this model are flow values directly corresponding to features 
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of the orthogonal representation, we choose to penalize deviation in these values. 
In other words, we count the number of changes in angles at vertices and bends 
at edges. Therefore, let = y\X = x) equal 



where a and (3 are nonnegative integers, and Zy\x is the normalizing constant. 
The dynamic model P{X = x\Y = y) thus compromizes between the number 
of bends in an orthogonal representation (the only criterion of readability), and 
the number of angles changed at vertices or bends (stability). This compromise 
can be biased by choosing specific values for parameters a (penalty for changing 
an angle between consecutive edges of a face), (3 (penalty for introducing or 
removing a bend on an edge) , and 7 (default penalty for bends) . 

Summarizing the above, a reasonable objective function for layouts x of G^‘^\ 
given a layout y of is 



U{-\y) is called the dynamic cost function. It is only piecewise linear. However, it 
is shown in the next section how minimization can still be performed by solving 
a minimum cost flow problem in a suitably defined network. 

4 Penalized Residual Networks 

Given a flow network N = (W, A;b,l,u,c) and a feasible flow y, the residual 
network with respect to y, Ny = (W,AU A;by,ly,Uy,Cy), is constructed by 
adding a reduction arc d for each a G 4, where d connects the same nodes as a, 
but is oriented in the opposite direction. A is the set of all reduction arcs. We 
define 

— residual supplies/demands 




0 



otherwise. 



V 






(w' ,w)^A 



{w ,w')^A 



for all w G W 
residual capacities 



ly{a) = ly{d) = 0 

Uy{a) = u{a) - ya 
Uy{d) = ya- I (a) 
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— residual costs 

Cy{a) = c{a) 

Cy(a) = -c(a) 

Since, for now, y is a feasible flow, by(w) = 0 for all w G W. Every flow z in Ny 
yields a circulation A(z) in N by setting 

A(z)a = Za- Za 

Obviously, y can be altered by adding A{z) to give a new feasible flow in N. 
When a flow should change only if a notable improvement of the cost function 
is achieved, alteration can be rendered more difficult by adding penalties to 
residual costs. The resulting network is called the penalized residual network, 
Ny = {W, A\J A] by, ly, Uy, c'y) , where c'y{a) = Cy{a) + 7Ta for all a € A U A. The 
nonnegative integers tTq represent the extra cost of changing flow along arc a. A 
flow z in the residual is called proper, if for all a S A at least one of Za and Zg, 
is zero. The following theorem summarizes the purpose of this construction. 

Theorem 2. Let y he a feasible flow in some network N . The eost of a proper 
flow z in N'y equals (c(«) ’ ~ ya) + T^a ■ \xa - ya\), where x = y + A{z) 

is a feasible flow in N . 

For networks N(G), parameters a and fl of the dynamic cost function are 
used to penalize flow alteration on arcs in Ay and Ap, respectively. Figure 2 
gives a small example. The following corollary of Theorem 2 shows that stable 
bend reduction can be carried out by computing a minimum cost flow in the 
penalized pseudo residual network. 

Corollary 1. Let = G, and y be a feasible flow in N{G). The eost 

of a proper flow z in Ny{G) equals U{x\y) — U{y), where x = y + A{z) is a 
feasible flow in N (G) . 

The flow y in N{G^^'>) defining the orthogonal representation of G*^^^ is only 
a pseudo-flow in N{G^‘^'>), since its index set no longer equals the set of net- 
work arcs. In a pseudo-flow, capacity constraints are obeyed, but flow balance 
constraints at nodes need not. However, the definition of the penalized resid- 
ual graph is easily generalized to flow vectors defined on a different index set 
by setting ya = 0 for all a not in the index set of y. Observe that residual 
demands/supplies no longer equal zero in general. 

Corollary 2. Given a grid embedding of G^^\ a grid embedding of aecord- 
ing to the dynamic layout model can he computed in 0{{n 6)^/^n-\/logn -|- b) 

time, where n is the maximum number of vertices in G^A GA ^ and b is the 
maximum number of bends in the orthogonal representations of GA and GA . 
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Fig. 2. Penalized residual network with respect to the flow in Fig. 1. Arcs aS 
A with capacities 1(a) = 0 = u(a) are not shown. Unlabeled thick arcs have 
capacity [0, 3] and cost a, unlabeled thin arcs have capacity [0, oo) and cost /3 

5 Graph Modification 

In this section, we sketch one way of maintaining the penalized residual network 
used to evaluate the dynamic cost function for a modified graph. For convenience, 
we restrict updates to the elementary modifications of inserting and deleting an 
edge, where the edge may be a bridge, i.e. incident to a vertex of degree one. An 
arbitrary number of elementary modifications in the graph may be traced before 
a new embedding is computed. The only restriction is that the underlying graph 
need to be connected, embedded and planar at all times, and 4-planar in the 
end. Therefore, more powerful update operations can be provided by combining 
elementary modifications into more complex ones. For example, vertex deletion 
is a simple sequence of edge deletions that can be carried out in one step. Note 
that modifications changing the embedding of the graph require more elaborate 
techniques and conventions for maintaining the network. 

Because of space limitations we do not go into details, but rather give an 
example for the construction of Ny(G^^'^) when an edge is inserted between 
existing vertices of See Fig. 3 and note that the residual demands/supplies 
of the two resulting copies of the face node need to be computed by restricting 
the sum of flow values to those arcs that remain incident to the node. The 
computational demand of this modification is thus proportional to the size of 
the face in G^^\ The other three elementary modifications can be performed in 
constant time. 

6 Examples 

Figure 4(a) shows a bend-minimum grid embedding of a small graph, com- 
puted according to Tamassia’s model. is modified elementary by inserting 
edge {7, 2}. A bend-minimum grid embedding of the resulting graph is shown in 
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Fig. 3. Insertion of an edge. The modified network has two copies of the face 
node, two copies of each arc corresponding to a subdivided angle (all with suit- 
ably adjusted values), and two new arcs crossing the new edge 



Fig. 4(b). Figures 4(c) and 4(d) show grid embeddings according to our dynamic 
model with a = /3 = 7 = 1, and with a = 0,/3 = 7 = 1, respectively. Changes 
with respect to the orthogonal representation of are highlighted in all three 
drawings of A vertex is rendered darker, if any angle between consecutive 
edges incident to it changed, and an edge is drawn thicker, if the sequence of its 
bends changed. 

It is readily seen that the number of changes in both Fig. 4(c) and 4(d) is 
much smaller than in Fig. 4(b). On the other hand, the number of bends is not 
minimum (12 in Fig. 4(c) and 4(d) compared to 10 in Fig. 4(b)). Thus, there is 
an obvious compromise between the number of bends and the number of changes 
with respect to the orthogonal representation of Fig. 4(a). 

A larger example is given in Fig. 5. Again, a single edge is inserted between 
two existing vertices, and embeddings obtained from the bend-minimum as well 
as from the dynamic model are shown. By allowing one additional bend, the 
dynamic model clearly succeeds in maintaining the overall structure. 
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Abstract. We consider the online list accessing problem and present a 
new family of competitive-optimal deterministic list update algorithms 
which is the largest class of such algorithms known to-date. This family, 
called Sort-by-Rank (sbr), is parametrized with a real 0 < a < 1, 
where SBr(O) is the Move-to-Front algorithm and SBr(1) is equiva- 
lent to the Timestamp algorithm. The behaviour of SBR(a) mediates 
between the eager strategy of Move-to-Front and the more conserva- 
tive behaviour of Timestamp. 

We also present a family of algorithms Sort-by-Delay (sbd) which 
is parametrized by the positive integers, where SBd(1) is Move-to- 
Front and sbd(2) is equivalent to Timestamp. In general, SBD(fc) is k- 
competitive for k >2. This is the first class of algorithms that is asymp- 
totically optimal for independent, identically distributed requests while 
each algorithm is constant-competitive. 

Empirical studies with with both generated and real-world data are also 
included. 



1 Introduction 

Consider a set {1, . . . , n} of n items that are to be stored as an unsorted linear 
list. A request for an item x is processed by searching the list for x starting 
from the head of the list. The cost of the retrieval is assumed to be the number 
of comparisons necessary to identify the item, i.e. the number of the position 
at which the item was found. After the retrieval, a list update algorithm may 
move the requested element to any position closer to the front without additional 
costs {free exchanges). Any other swap of two consecutive items incurs cost 1 
{paid exchanges). The goal of such exchanges is to minimize the cost for future 
accesses. When the list update algorithm processes the requests without knowing 
future queries, it is said to work online. 

In the sequel we will only consider requests to items and no insertions or 
deletions (static model). It is possible to extend the analyses to the dynamic 
model where the set of list items may change over time, the results carry over 
to this situation. 

* supported by a Graduiertenkolleg fellowship from the Deutsche Forschungsgemein- 
schaft (DFG), Germany. 
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List update algorithms are interesting because they are frequently used in 
practice. The most direct application is the implementation of a dictionary as 
a singly-linked list. As online algorithms constantly try to adapt to the request 
sequence, they can approximate the process that generated the requests. More- 
over, this is achieved in a computationally inexpensive way. As such, list update 
schemes have been successfully applied in data compression algorithms [1,5] Sur- 
prisingly, online algorithms also gain importance in geometric applications such 
as computing point maxima or convex hulls [9] . 

In competitive analysis, an online algorithm ALG is compared to an optimal 
offline algorithm opt. Let a be any sequence of requests to items in the list 
(we assume that all queries are successful, i.e. every requested item is actually 
in the list). Denote by ALG((t) the total cost incurred by ALG to serve cr, and 
OPt((t) the total cost incurred by an optimal offline algorithm. Then ALG is said 
to be c-competitive (having a competitive ratio of c) if there exists a constant a 
such that ALG((t) < c • OPt(ct) -|- a for all request sequences cr . 

In average case analysis, the request sequence is generated according to some 
probability model. In the independent model, a probability distributions over 
the list items is given. The requests are the realization of a sequence of indepen- 
dent, identically distributed random variables with this distribution (the requests 
are generated by some discrete memoryless source). In the markov model, the 
requests are generated by a (first order) Markov chain. In those models, the 
asymptotic expected costs of online algorithms and of optimal offline algorithms 
and their ratios are of main interest. 



1.1 On Previous Work 

The most famous list update algorithm is Move-to-Front, it has been studied 
since 1965 [12]. 

Move-to-Front (mtf): Upon a request for item x, move x to the front 
of the list. 

In their seminal work on competitive analysis [14], Sleator and Tarjan showed 
that MTF ist 2-competitive. Raghavan and Karp proved a lower bound 
of 2 — 2/ (n -I- 1) on the competitive ratio of any deterministic list update scheme 
and list size n. Hence, mtf is a competitive-optimal algorithm^. 

The important Timestamp algorithm discovered recently by Albers [2] is 
2-competitive as well. 

Timestamp (ts); Upon a request for item x, insert x in front of the 
first item y that precedes x and that was requested at most once since 
the last request for x. Do nothing if there is no such item y, or if a; has 
been requested for the first time. 

^ All algorithms with competitive ratio 2 will be called optimal. As mtf does indeed 
have a competitive ratio of 2 — 2/(n -|- 1), it is sometimes called strongly- optimal. 
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Both algorithms have optimal competitive ratio, nevertheless they are rather 
different. This difference is blurred by competitive analysis, but it is revealed 
when average case analysis is employed. In the independent model, the ratio 
between the expected cost of the mtf algorithm and a static optimal algorithm 
can be arbtrarily close to tt/ 2 = 1.5707 . . . [10]. For TS, this ratio is less than 1.34 
on any access distribution [1]. On the other hand, for Markov sources with the 
phenomenon of locality, mtf fares better. This demonstrates that Timestamp 
reacts more conservatively, and mtf more eagerly. 

Recently, a new family of list update algorithms has been introduced by El- 
Yaniv [8]. The Move-to-RECENT-Item(/c) algorithms are located “between” 
the MTF and TS strategies. 

MoVE-TO-RECENT-lTEM(fc), MRi(A:): Upon a request for item x, move x 
forward just after the last item y that is in front of x and that was 
requested at least k + 1 times since the last request for x. If there is no 
such item y of if this is the first request for x, move x to the front. 

It has been shown that MRi(l) is equivalent to the Timestamp algorithm, and 
that Move-to-Front is the limit element of that family, as k ^ oo. Moreover, 
for all A: > 1 mri(A:) is 2-competitive. Hence there is an infinite (countable) class 
of competitive-optimal list update algorithms. 

1.2 New Results 

We define a new infinite (uncountable) family of list update schemes that turn 
out to be competitive-optimal. Let <t = cri, ct 2 , . . . , (Jm be any request sequence. 
Consider any point t, 1 < t < m, in time, and define for each list item x the last 
access time wi{x,t), 

wi{x,t) = max({t' < t : at' = x}U {0}) . 

For A: > 1 define the A;th last access time Wk{x,t) to x at time t, 

Wk (x, t) = max({F < Wfe_i (x, t) ■. a f = x} U {0}) . 

Now consider for A; > 1 the rank functions 



Sk{x,t) =t- Wk{x,t) . 

Lemma 1. (a) For all t, such that each list item has been requested at least once, 
we have: When the list is maintained by Move-to-Front, the current list 
ordering is [xi, . . . , x„] iff 

Sl(xi,t) < Si(X 2 ,A) < • • • < Si{Xn,t). 

(b) For all t, such that each list item has been requested at least twice, we have: 
When the list is maintained by Timestamp, the current list ordering is 
[xi,...,x„] iff 

S2{xi,t) < S2{X2,t) < • • • < S2{Xn,t). 
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Due to lack of space, the proof is omitted. Note that this approach gives a very 
natural interpretation of the Timestamp rule. 

For a > 0, define a new rank function as the convex combination of si(x,t) 
and S 2 {x,t): 



rt{x) = rt{x,a) = {I — a) ■ si{x,t) + a ■ S 2 {x,t). 

Then we define the new family of list update algorithms. 

SORT-BY-RANK(a), SBR(q!): Upon a request for item x at time t, insert x 
just after the last item y in front of x with rt{y, t) < rt(x, t). Insert x in 
front of the list if there is no such item y, or when x is requested for the 
first time. 

Note that SBr(O) = MTF and SBr(1) is equivalent to TS modulo the handling 
of first requests. This means that SBr(1) and TS acted similarly if TS would be 
changed such that it moves an item to the front of the list upon its first request. 

Theorem 1. For 0 < a < 1, the algorithm SBr(q;) is 2-competitive. 

This result provides the first example of an uncountable class of competitive- 
optimal list update algorithms. The SBr(o) schemes use 2n timestamps for all a. 
In contrast, the mri(A:) rules need nk timestamps which is significant when con- 
sidering rules “close” to MTF. We remark that in general, the SBR(a) algorithms 
do not share the pairwise independence property that is satisfied by MTF and TS. 
Hence the proof is based on list factoring and a potential function argument 
similar to the one used in [8]. Note that for 0 < a < 1, the SBd(q;) schemes 
interpolate between MTF and TS. For a > 1, SBD(a) “extrapolates”, and in the 
limit a — > oo, the list is sorted by increasing time differences S 2 {x,t) — s\{x,t) 
between the second last and the last access to x. However, in these cases, the 
algorithms are no longer 2-competitive, as can be easily shown by example. 

The second class of list update schemes that we examine is 

SORT-BY-DELAY(fc), SBd(/c): Upon a request for item x at time t, insert x 
just after the last item y in front of x with Sk{y,t) < Sk{x,t). Insert x 
in front of the list if there is no such item y, or when x is requested for 
the first time. 

Here we have SBd(1) = MTF and SBD(2) is equivalent to TS in the above 
sense. Thus, SBD(fc) turns out to be a natural generalization of MTF and TS. 

Theorem 2. For every k >2, SBd(A:) is k- competitive. 

Lemma 2. An equivalent definition of SBD(k) is the following: 

Upon a request for item x, insert x in front of the first item y that 
precedes x and that was requested at most k—1 times since the {k — l)th 
last access to x. Insert x in front of the list if there is no such item y, 
or if X has been requested for the first time. 



Two New Families of List Update Algorithms 103 



It turns out that the SBD(fc) rules do satisfy the pairwise independence prop- 
erty. This makes it easy to achieve an average case analysis. 

Theorem 3. For any probability distribution p = (pi, . . . ,Pn), the asymptotic 
expected costs per request for SBD(k) are 






k-1 

E E 

l<i<j<n e=0 



2k -1 
i 












(Pi + Pj) 



2k-l 



For k = 1, this has been found in [12], and for /c = 2 see [1]. We compare 
this expected cost with the optimal cost M(p) incurred by an optimal static 
adversary. For pi > P 2 > • • • > Pm we have M{p) = iPi- 

Theorem 4. The family SBD(k) is asymptotically optimal, i.e. for all distribu- 
tions p we have 

EsBB{k){p) — > M{p) for k^ oo. 

In particular, the cost ratios are bounded as follows 



EsBB(i){p)/M{p) < 1.5708 
i?SBD(2)(p)/M(p) < 1.3391 
-^SBD(3)(p)/-^(p) < 1.3044 
Esbb{a){p)IM{p) < 1.2904 
Esbb{5){p)/M{p) < 1.2828. 

For fc = 1 (mtf), the ratio is bounded by 7t/2 < 1.5708, as proved in [7], and for 
k = 2 (ts), the ratio has been bounded in [1]. 

While the SBd(A:) algorithms are not competitive-optimal, they are constant- 
competitive and, under the independent model, achieve an expected search time 
that is arbirtrarily close to the optimal expected search time of an static adver- 
sary. To the best of our knowledge, all previous list update algorithms with this 
property, e.g. frequency-COUNT, SiMPLE(fc)-MTF or BATCHED(fc)-MTF have a 
competitive ratio that is linear in the length n of the list, for all A: > 2. 

The constant-competitiveness makes the SBD(fc) schemes attractive for use 
with request sequences with a low degree of locality. In particular, the empirical 
studies suggest that there is a parameter k* that suits best to the degree of 
locality of a given sequence, i.e. SBD(fc*) achieves the lowest cumulative access 
costs among all SBd(A:) schemes. 



In section 2, the Sort-by-Rank algorithms are investigated and theorem 1 
is proved. Section 3 outlines the analysis of the Sort-by-Delay rules. In sec- 
tion 4, empirical results for both families are reported. They were obtained from 
streams of independent identically distributed requests and from request se- 
quences obtained from the Calgary Compression Corpus [15], a standard bench- 
mark collection for file compression. Section 5 concludes with some remarks and 
open questions. 
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2 Analysis of the Sort-by-Rank Algorithms 



First we give an example showing that the SBR(a) rules do not in general satisfy 
the pairwise independence property. This property means that for every request 
sequence a and two arbitrary items x and y in the list L, their relative order is 
the same when a is served as their relative order in L^y (the two-element list 
holding X and y) when the projected sequence a^y is served [4]. The projeted 
sequence Uxy is defined as a after deletion of all requests for items other than x 
and y. Consider the request sequences cti = x, y, y, a, x and (T 2 = x, a, y, y, x. 
The projection onto {x,y} gives in both cases axy = x,y,y,x. But for a = 1/2 
and t = 5, we have in the first case rfyx) = 2 < 2.5 = rt{y) and in the second 
case rt{x) = 2 > 1.5 = rt{y). 

For the proof of theorem 1, we need some preliminaries on list factoring. This 
technique provides lower bounds on the cost OPt((t) of the optimal algorithm on 
the request sequence a by considering a collection of optimally maintained two- 
element lists. Let FOPT be an optimal “factored” offline algorithm that maintains 
the collection of all ( 2 ) two-element lists Lxy for all sets {a;, y} C L. Hence, each 
such list Lxy is controlled by an optimal offline algorithm that serves the request 
sequence <7xy ■ Denote by FOPTt the cost of serving at which is incurred by fopt, 
i.e. the number of two-element lists with at in the second position. Then, for 
any request sequence cr = cri , . . . , am , we have the following lower bound for the 
optimal offline cost: 

m 

^ FOPTt < OPT(ct). (1) 

t=l 

Further details are omitted due to the lack of space, they may be found in the 
full version or in [8]. 

Let L be the list maintained by SBr(q;) . When item x is currently in front of 
item y in L, We say (x, y) in L. When in the same situation y is in front of x in 
the list Lxy maintained by fopt, then (a;, y) is called an inversion. 

For each pair of items a; fy y, we define the weight function 



w{x,y) 



0 : if {x, y) is not an inversion 

1 : if (a;, y) is an inversion and y will pass x 

< in L if y is requested next . 

2 : if (a;, y) is an inversion and y will pass x 

in L iff y is now requested twice in a row 



Then we define the potential function = '^x^y w{x, y). 

The potential is always non-negative. Fix a request sequence cr = cti, . . . , am- 
The amortized cost at for SBR(a) to serve at is defined as at = SBR{a)t+d>t—d>t-i, 
where SBR(a)t is the actual cost incurred by SBR(a) in this step, and <Lt is the 
potential at time t. The proof demonstrates that there is a constant c = 2 
such that SBR(a)t < c-FOPTt for all requests t = 1, . . . , to. Then it follows from 
equation (1) that SBr(q;) is 2-competitive. 
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Lemma 3. Let y and z be two items sueh that {z,y) in L is an inversion with 
weight w{z,y) = 1. Let x be any item other than y or z. Then w{z,y) remains 1 
after a request for x is served by SBR(a). 

Let the current request be for item at = x. Without loss of generality we 
assume that each element has been requested at least once. Define the set A as 
the collection of all items z such that ( 2 ;, x) in L is not an inversion; the set B as 
the set of all z such that (z, x) in L is an inversion, and the set C as the collection 
of all elements z such that (a;, z) in L is an inversion. Note that these three sets 
are pairwise disjoint. Set A = |Al|, B = \B\ and C = \C\. 

Let the actual cost at time t be SBR(a)t. This is equal to A + B + 1, as the 
elements in front of x are exactly the items from AUS. The cost incurred in the 
optimally maintained factored list is FOPTt = A + C + 1, because exactly the 
sets {z,x} with z € A U C form a contribution to this cost. 

Now we consider the change in potential due to the action of SBR(a) when 
serving x = at- If an element z € B is passed by x, the weight of the inversion 
(z, x) in L was 1 before the move, and it is now eliminated. If an element z G B 
is not passed by x, the weight of the inversion (z, x) in L was 2 before the move, 
and is now reduced to 1. Hence there is a decrease in potential of exactly B 
due to items in B. On the other hand, due to SBr(q;)’s move, the weight of each 
inversion {x, z) with z ^ C may increase by one which results in a total increase 
of at most C. 

Next we examine the change in potential due to the action of the optimal of- 
fline algorithm FORT maintaining the factored list when serving x = at- Suppose 
that a new inversion (a;, z) in L is created due to the fact that x has passed z 
in L, but not in L^z- Then z G A before the request and we use the following 
lemma: 

Lemma 4. Let 0 < a < 1. Lf at time t in the list L, item at = x has passed an 
element z G A, then si{z,t) < S 2 {x,t)- If this request created an inversion (x, z) 
in L, and the next request is for at+i = z, then z will pass x. In other words, 
this inversion has weight w(x,z) = 1. 

Next suppose that a new inversion (x, z) in L is created due to the fact that x 
has passed z in L^z, but not in L. Then z € A before the request, and the weight 
of the inversion can only be one, since a subsequent request for x = at-f-i would 
move X to the front and remove this inversion. Hence, the weight of the new 
inversions is at most A. 

In total, the change in potential after serving the request at can be bounded 
as d>t — d>t-i < C — B + A. The amortized costs for SBR(a) to serve at are 

at = SBR(a)t + <Pt- <Pt-i 
<A+B+1 + C -B+A 
= {2-{C + 1)/(A + C + 1)) • (A + C + 1) 

< (2 - 1/n) • FOPT*. 

This completes the proof that SBR(a) is 2-competitive for 0 < a < 1. 
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3 Analysis of the Sort-by-Delay Algorithms 

The proof of SBd(/c) being ^-competitive for A: > 2 is a modification of the proof 
of theorem 1 with a different potential function; it is based on the following 
weight function 



{ 0 : if {x, y) is not an inversion 
(, : if (a;, y) is an inversion and y will pass x in L 

iff y is now requested i times, ^= 1 , . . . , fc 

The pairwise independence property of the SBD(fc) rules is an immediate 
consequence of lemma 5. 

Lemma 5. Upon a request for item x, all items that were requested at least k 
times since the {k — \)th last request for x are in front of all items that were 
requested less than k times since the (k — l)th last request for x. 

Lemma 6. Consider any point in the request sequence where there have been at 
least 2k — 1 requests to x or y. Then y precedes x if and only if the majority (i.e. 
at least k) of the 2k — 1 last requests to x or y have been for y. 

The proof of theorem 3 is based on lemma 6. Similarly, lemma 6 can be used 
to obtain explicit formulae for the expected search cost of SBD(fc) for Markov 
dependent request sequences. The first part of theorem 4 follows with a Chernoff 
bound argument, in the second part the calculations are similar to [7]. 

4 Empirical Results 

In figure 1, the simulation results of SBR(a) for the Zipf distribution {pi ocl/i) 
are presented for 0 < a < 1, and the average search cost on a list with n = 50 
items is depicted. The SBr(q;) rules provide a gradual transition from mtf to TS. 
This confirms our intuition in the average case setting where analytical results 
are difficult to achieve. 
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We conducted compression experiments with the SBd(/c) rules on the Cal- 
gary/ Canterbury Compression Corpus [15]. The list update schemes work as 
“engines” that dynamically define a statistical model for the data. Initially the 
list is sorted by decreasing access frequencies of the characters. Then the charac- 
ters are encoded sequentially. When a character has been found on position i of 
the list, its position is encoded with Elias encoding, a standard variable length 
prefix-free code for positive integers, using 1 -|- 2 [log 2 (z)J bits. 

For example, the compression performance of SBD(fc) on the file geo is de- 
picted in figure 2 for 1 < k < 200. The value k* shall describe the rule with 
the best compression among all SBd(/c) schemes. Hence the adaptive discipline 
SBD(fc*) fits best to the particular degree of locality found in the input stream. 
Sometimes however, the value k* is not sharply distinguished. Therefore, k* can 
not give a precise quantification of locality in general. 

In the following table, the results are collected and compared with the sizes of 
the original file and the results achieved by Move-to-Front compression (mtf) 
and by Frequency Count compression (fc). The latter uses as statistical 
model a static list that is ordered by decreasing access frequencies. The results 
of SBD(fc*) are significantly better than mtf compression, with typical savings 
around 15 %. However, in comparison with Frequency Count, SBd(/c*) hardly 
manages to break even. The good performance of EC on the Calgary Compression 
Corpus has already been observed in [3]. 



File 


Original 

Bytes 


EC 

Bytes 


MTF 

Bytes 


k* 


S( 

Bytes 


3RT-BY-E 
% Orig 


elay(A: 

% FC 


*) 

% MTF 


bib 


111261 


88641 


106468 


73 


89145 


80.1 


100.6 


83.7 


bookl 


768771 


516198 


644418 


153 


516966 


67.2 


100.1 


80.2 


book2 


610856 


434798 


515255 


23 


432940 


70.9 


99.6 


84.0 


geo 


102400 


85533 


107422 


19 


85687 


83.7 


100.2 


79.8 


news 


377109 


292800 


333729 


18 


289925 


76.9 


99.0 


86.9 


objl 


21504 


19304 


19351 


3 


17802 


82.8 


92.2 


92.0 


obj2 


246814 


237165 


250952 


7 


219668 


89.0 


92.6 


87.5 


paper! 


52161 


39617 


46140 


25 


39066 


73.5 


98.6 


84.7 


paper2 


82199 


56356 


69437 


54 


56539 


68.8 


100.3 


81.4 


papers 


46526 


32497 


39373 


35 


32697 


70.3 


100.6 


83.0 


paper4 


13286 


9229 


11273 


30 


9327 


70.2 


101.1 


82.7 


paper5 


11954 


8741 


10195 


18 


8759 


73.3 


100.2 


85.9 


papei'6 


38105 


28487 


32073 


13 


27320 


71.7 


95.9 


85.2 


pic 


513216 


111607 


119125 


7 


109733 


21.4 


98.3 


92.1 


progc 


39611 


30779 


39150 


13 


30414 


76.8 


98.8 


77.7 


progl 


71646 


51260 


55178 


14 


48776 


68.1 


95.2 


88.4 


progp 


49379 


35066 


40041 


9 


34903 


70.7 


99.5 


87.2 


trans 


93695 


82297 


82055 


5 


76895 


82.1 


93.4 


93.7 



108 



Frank Schulz 



We are optimistic that the possibility to adapt to the degree of locality is 
fruitful in other applications. Similarly, the SBR(a) schemes could be used for 
fine-tuning to data streams with very high level of locality, such as the ouput of 
the Burrows- Wheeler-transform [6]. 

5 Conclusion 

We introduced and analysed two families of deterministic list update algorithms, 
called SORT-BY-RANK(a) and SORT-BY-DELAY(fc). Empirical studies with se- 
quences from the Calgary Compression Corpus suggest that the parameter k* 
which denotes the rule SBd(A:*) with the best compression performance could 
be considered as a crude measure of the degree of locality of that sequence. 
The question of quantifying the locality of reference in data streams remains a 
challenge of major importance. 

It would also be interesting to experiment with non-linear rank functions, for 
example rt{x) = {si{x,t)P + S 2 {x,t)Py^P . Finally, an open question is to improve 
the tradeoff between competitive ratio and expected search cost on memoryless 
sources as exhibited by the SBd(/c) family, or to prove a lower bound. 
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Abstract. We consider the combinatorial stack-up problem which is to 
process a given sequence q of bins by removing step by step bins from 
the first s positions of the sequence onto p stack-up places. We prove 
that the Most-Frequently algorithm has best worst-case performance of 
all on-line stack-up algorithms and is, additionally, the best polynomial 
time approximation algorithm for the stack-up problem known up to 
now, although it is a simple on-line algorithm. 



1 Introduction 

In this paper, we consider the combinatorial problem of stacking up bins from 
a conveyer onto pallets. This problem basically appears in so-called stack-up 
systems which play an important role in delivery industry and warehouses. Cus- 
tomers order a large amount of articles, which have to be packed into bins for 
delivery. This work is usually done in so-called pick-to-belt orderpicking systems, 
see [dK94,LLKS93]. Typically, not all articles ordered by one customer fit into a 
single bin, so each order is divided into several bins. For delivery, it is necessary 
that all bins belonging to one order are placed onto the same pallet, and bins 
for different orders are placed onto different pallets. Bins arrive the stack-up 
system on the main conveyer of the orderpicking system. At the end of the main 
conveyer they enter a storage conveyer. The bins are picked-up by stacker cranes 
from the storage conveyer and moved onto pallets, which are located at so-called 
stack-up places. Automatic driven vehicles take full pallets from stack-up places, 
put them onto trucks, and bring new empty pallets to the stack-up places, see 
also [RW97c]. 

Logistic experience over 10 years lead to such high flexible conveyer systems 
in delivery industry. So we do not intend to modify the architecture of existing 
systems, but try to develop a model that captures important parameters of stack- 
up systems, and to develop efficient and provable good algorithms to control 
them. Today, real stack-up systems are controlled by fuzzy logic. Since almost 
all decisions have to be done at a time where only a part of the complete sequence 
is known, we focus our work on on-line algorithms. 



K.-Y. Chwa and O. H. Ibarra (Eds.): ISAAC’98, LNCS 1533, pp. 109—118, 1998. 
(c) Springer-Verlag Berlin Heidelberg 1998 
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From a theoretical point of view, we are given a sequence of pairwise distinct 
bins q = (6i, . . . , 6„) and two positive integers s and p. Each bin bi of q is destined 
for one pallet, where the number of bins for a pallet is arbitrary. The bins have 
to be removed step by step from q. If a bin is removed from the sequence, then 
all bins to the right are shifted one position to the left. The position of a removed 
bin must not be greater than s, and after each removal of a bin there must be at 
most p pallets open. A pallet t is called open, if at least one bin for t is already 
removed from sequence q, but at least one bin for t is still contained in the 
current sequence. Integer s corresponds to the capacity of the storage conveyer, 
and integer p corresponds to the number of stack-up places, because each open 
pallet occupies one stack-up place. A sequence q is called an (s,p)-sequence if it 
can be processed with a storage capacity of s bins and p stack-up places. 

The following results are already known about the stack-up problem. 
In [RW97c] it is shown that the stack-up decision problem is NP-complete [GJ79], 
but belongs to NL if the storage capacity s or the number of stack-up places p 
is fixed. In [RW97b] the performances of several on-line algorithms are com- 
pared with optimal off-line solutions by a competitive analysis [MMS88]. The 
Most-Frequently algorithm - or MF algorithm for short - processes each (s,p)- 
sequence with a storage capacity of {p -\-l)s — p bins and p stack-up places, or 
with a storage capacity of s bins and at most p ■ (log 2 (s) -I- 2) stack-up places, 
respectively. In [RW97a], a polynomial time off-line approximation algorithm for 
the processing of (s,p)-sequences with a storage capacity of s ■ [log 2 (p-l- 1)] and 
p-\-l stack-up places is introduced. 

In contrast to the performance analysis given in [RW97b], we show in this 
paper that the MF algorithm is even an optimal on-line stack-up algorithm. Fur- 
thermore, we do not only measure the performance in terms of one objective, but 
we also relax both resources, the storage capacity as well as the number of stack- 
up places. We look at on-line stack-up algorithms that take a storage capacity 
of s -I- c bins to process (s,p)-sequences, where c is any nonnegative integer. We 
give a lower bound on the number of stack-up places each such algorithm takes, 
and prove an upper bound of p-\- p ■ log 2 (^;:|:j- -I- 1) -I- 1 the MF algorithm takes. 
Furthermore, we show that the MF algorithm processes each (s,p)-sequence q 
with a storage capacity of 2s bins and 2p stack-up places. Therefore, the MF 
algorithm is the best polynomial time approximation algorithm for the stack-up 
problem known up to now that approximates both objectives within a small 
factor. We emphasize that this approximation can be achieved by a very simple 
on-line algorithm. So Most-Frequently seems to be the right algorithm to put 
into practice. 

2 Preliminaries 

Throughout the paper s and p are positive integers, and c is a nonnegative 
integer. We consider sequences q = (6i,...,6„) of n pairwise distinct bins. A 
sequence q' = (6q, . . . ,bij^) is a subsequence of g if 1 < < • • • < ik-i < 

ik < n. The position j of a bin bi^ in sequence q' = {bi^ is denoted by 
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pos{q' ,bi-). If X is the set of bins removed from q to obtain q' then q' is also 
denoted by q — X. 

Each bin 6 of g is associated with a pallet symbol plt(b) which is in general 
some positive integer. We say bin b is destined for pallet plt{b). The labeling of 
the pallets is insignificant, because we only need to know whether or not two bins 
are destined for the same pallet. The set of all pallets of the bins in sequence q 
is denoted by plts{q) :={ pit (b) \ b G q }. 

Suppose we remove step by step some bins from a sequence q = (6i, . . . , 6„). 
The successive removal of k bins yields k subsequences 

90 = 9, qi = q- {bji}, q 2 = q- {bj^,bj^}, ..., = g- 

of q. Each pair (g, g^) is called configuration. A pallet t is defined to be open in g^ 
if and only if some bin for pallet t is already removed from g, but sequence g^ still 
contains at least one bin for pallet t. The set of all open pallets in configuration 
(g, gi) is denoted by open{{q, qf)) . 

The removal of some bin b from subsequence qi is called {s , p) -transformation 
step, if 

pos{qi,b)<s, \open{{q,qi))\ <p and \open{{q,q^ - {b}))\ < p. 

A sequence of (s,p)-transformation steps that transforms sequence g into some 
subsequence qi of g is called {s,p) -transformation of g into qi . The integers s and p 
represent the used storage capacity and the used number of stack-up places. An 
(s,p)-transformation of g into some empty sequence is called {s,p) -processing 
of g. If such an (s,p)-processing exists for sequence g, then g is called an {s,p)~ 
sequence. The stack-up problem is to decide whether there is an (s,p)-processing 
for a given sequence g and given positive integers s and p. 

It is sometimes convenient to use pallet identifications instead of bin identi- 
fications. So we will write g = {plt{bi), . . . , plt{bn)) instead of g = (6i, . . . , fo„). 

Example 1. Consider sequence g = (1, 1, 2, 3, 2, 3, 2, 3, 2, 1). The table below 
shows a (3, 2)-processing of sequence g. The underlined bin is the bin which 
will be removed in the next transformation step. The columns marked Si and ji 
will be explained later. 



i 


qi = 


open{{q,qi)) 


Si 


ji 


0 


(1,1, 2, 3, 2, 3, 2, 3, 2,1) 


0 


1.1,2 


4 


1 


(1,1, 3, 2, 3, 2, 3, 2,1) 


{2} 


1.1,3 


5 


2 


(1,1, 2, 3, 2, 3, 2,1) 


{2,3} 


1,1,2 


6 


3 


(1,1, 3, 2, 3, 2,1) 


{2,3} 


1.1,3 


7 


4 


(1.1, 2, 3, 2,1) 


{2,3} 


1,1,2 


8 


5 


(1, 1,3,2, 1) 


{2,3} 


1,1,3 


9 


6 


(1.1, 2,1) 


{2} 


1.1,2 


10 


7 


(1,1,1) 


0 


1.1,1 


11 


8 


(1,1) 


{1} 


1,1 


11 


9 


(1) 


{1} 


1 


11 


10 


0 


0 
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Consider an (s,p)-transformation of sequence q into some sequence q' . Let S 
denote the sequence consisting of the first s bins of g', and let bj be the first bin 
of the last |g'| — s bins of q' . Then we can uniquely specify configuration {q,q') 
by some triple (q,S,j). Set S is called the storage. If q' contains less than or 
equal to s bins then j is defined by |g| + 1, see also example 1. 

We consider algorithms where a sequence q and a storage capacity s is given to 
the input, while the number of places p and an (s,p)-processing of q is computed 
by the algorithm. The discussion in [RW97b] allows us to make the following 
assumptions about the processing of sequences q with a storage capacity of s 
bins. Let (g, S', j) be a configuration. 

1. The bins in storage S that are destined for already open pallets will be 
removed automatically from S. 

2. If storage S contains all bins for a pallet t, and no bin for an open pallet, 
then t will be opened automatically by removing the first bin for t from S. 

Our stack-up algorithms will only be asked for a decision if each bin in the 
storage is not destined for an open pallet, and the storage does not contain all 
bins for some pallet. Such configurations are called decision configurations with 
respect to s. In a decision configuration (g, S, j) the stack-up algorithm decides 
which pallet t is opened next by removing the first bin b destined for pallet t 
from the storage. On-line algorithms only know the bins of g up to position 
J — 1, but additionally the number of bins for each pallet. Thus, it is possible to 
recognize configurations in that on-line algorithms have to open another pallet. 
In example I the first two configurations are decision configurations with respect 
to s = 3. 

3 The Most-Frequently Algorithm 

For a sequence of bins g and a pallet t let 

h{q,t) := |{ 6 e g : plt{b) = t }| 

be the number of bins for pallet t in g. The MF algorithm opens in a decision 
configuration (g, S,j) one of the pallets t € plts{S) for which h{S,t) is maximal. 
If two or more pallets have the same number of bins in storage S then ties will 
be broken by any deterministic rule. 

For a position j let cut{q,j) be the set of all pallets t such that the first bin 
for pallet t is on a position less than j in sequence g and the last bin for pallet t 
is on a position greater than or equal to j in sequence g. 

Consider a processing of sequence g by the MF algorithm with storage ca- 
pacity s. Assume during this processing of g we get I decision configurations 
(g, Si,ji), . . . , (g, Si^ji) in which the MF algorithm opens the pallets ti, . . . , ti, 
respectively. Then we define 

/imin(g, s) := min{ ..., h{Si,ti), s }. 
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If during the processing of q no decision configuration is reached, then 
hmin(9, s) = s by definition. Please note, that in this special case the processing 
takes only one stack-up place, because in each configuration the storage contains 
a bin for some already open pallet, or it contains all bins for some pallet. 



Lemma 1. For each (s,p) -sequence q and each integer c> 0 it holds 



P > 



c+l 

hmin S -f c) 



Proof. Consider a processing of an (s,p)-sequence q by the MF algorithm with 
storage capacity s-l-c. If no decision configuration is reached, h^in{q, s+c) = s-|-c 
by definition, and the result follows. Let (q,S,j) be a decision configuration in 
which the MF algorithm opens a pallet t, then the s -I- c bins in storage S 
are destined for at least |~ 1 pallets, because by the definition of the MF 

algorithm each pallet has at most h{S,t) bins in S. Since {q,S,j) is a decision 
configuration, we get 



\cut{q,j)\ > 



~ s c ~ 
h{S,t) ■ 



Let A be an algorithm that yields an (s,p)-processing of sequence q. If A 
reaches the configuration (g, S',j) for some S' and position j as above, then 



\open{{q,S',j))\ > 



c 

h{S, t) ’ 



because S' has only a capacity of s bins. If h{S,t) divides c then there are 
at least s bins in g on a position less than j which are destined for pallets in 
cut{q,j). In this case, at least + 1 places are necessary to continue the 

processing by A. This strengthen the inequality to 



|open((g,S",j))| > 



■ c -I- 1 ■ 
h{S, t) ■ 



Since p > \ open{{q, S' , j))\ and since there is at least one decision configuration 
{q,Si,ji) in which the MF algorithm will open a pallet ti such that h{Si,ti) = 
hminiq, s + c), the result follows. □ 



Theorem 1. For given integers s, p > 0 and c > 0, the MF algorithm computes 
an (s + c,p') -processing of any {s,p)-sequence q for some 



Proof. Consider a processing of g by the MF algorithm with storage capacity 
s + c. If no decision configuration is reached, the result follows because p' = 1. 
Let (g. S', j) be a decision configuration in which a pallet t is opened and where 



' s ■ p ~ 



- 2 . 



|open((g,S, j))| = p + 



c+l 
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Let T = open{{q, S', j))U{t}. By the definition of stack-up algorithms, the pallets 
of T are opened in decision configurations. Therefore, for each of the p+ \ — 1 
pallets in T there are at least s -I- c) bins in g on a position less than j. 

Let (g, S', j) be a corresponding configuration during an (s,p)-processing of g 
for some S' and position j as above. By lemma 1, we know that hmin(g, s -I- c) > 
Since ^ there are either p — 1 pallets of T open in con- 

figuration (g, S', j) and another pallet of T is opened in the next transformation 
step, or there are p pallets of T open in (g. S' ,j), respectively. The storage dur- 
ing the next transformation steps contains always a bin for an open pallet until 
the last bin for some of the open pallets is removed from g. Since all these open 
pallets are in T, and are therefore also open during the processing by the MF 
algorithm, the MF algorithm removes the last bin for an open pallet during the 
next transformation steps, too. After that there are only p + — 2 pallets 

open and the processing continues. □ 

Corollary 1. The MF algorithm takes at most 2p stack- up places and a storage 
capacity of 2s bins to process {s,p)-sequences. 

Proof. Follows immediately by theorem 1 for c = s — 1. □ 

4 Lower Bounds for On-Line Stack-Up Algorithms 

For TO pallets ti, . . . ,tm let [ti , . . . , tm] be a sequence of to bins (6i , . . . , 6m) such 
that plt{bi) = ti for i = 1, . . . , m. For two sequences of bins g = (6i, . . . , 6„) and 
g' = (6^, . . . , b'^,) let g o g' = (6i, . . . , 6„, 6^, . . . , b'^,). For a positive integer k and 
a sequence of bins g let 

fc times 

q = q o go ••• o q . 

For example, if g = (1, 2, 3) and g' = (4, 5, 6) then qoq' = (1,2, 3, 4, 5, 6) and g^ = 
(1,2,3, 1,2, 3,1, 2,3). 

Theorem 2. For each on-line algorithm A, each integers s, p > 0 and c > 0 
there are {s^p)- sequences q such that A needs at least p -\- m stack-up places to 
process q with a storage capacity o/s-|-c bins, where Xi < s for the following 
definition of Xi 

^fi = l 

■ ic+l + Yf~Xxj)'] ifi>l. 

Proof. Let g = gi o g( o . . . o gm-i-i o q!^^i such that for i = 1, . . . , to and some 
t' € plts{qi) 

f o [^ 1 , 2 ]^' o • • • o if Xi divides s -|- c 

1 ° ' °[ti o [ti otherwise 

and 

T - {t'i}- 
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Each Qi consists of exactly s-|-c bins, i.e., yi = s + c— -Xi. Let additionally 

qm+i = o • • • o 

and Qjn+l ~ [^m-l-1,1 ) • • ■ ) tl J • • ■ ) tjn] ■ 

We assume that all pallets tij are pairwise distinct. There is one such sequence q 
for each choice of the m pallets and each can be processed with a 

storage capacity of s bins and p stack-up places by opening the pallets in the 
order defined by the bins in gj. ° ° Qm+i- 

Let ti^i be the pallet for which there is a bin in sequence qt but no bin in 
sequence q[. Then by definition . . . , ... is the order of pallets de- 

fined by the bins in sequence q[. Consider the first decision configuration (g, S, k) 
after all bins for the pallets in plts{q[_i) have been removed from q. Then stor- 
age S contains bins for the pallets and the first bins of sequence qi. 

Since < s by assumption, and since the Xi are monotone increasing, 

it holds X]j=i ^3 < ^ for each i = 1, . . . , m, or equivalently 

i-l 

Xi + \ < S — Xj . 

i=i 

Therefore, there are Xi bins for pallet in the storage of decision configuration 
(q,S,k), and at least one further bin for pallet ti^ 2 - After all bins for pallet ti^i 
have been removed from qi, there are Xi bins for pallet ti^2 in the storage of 
the next decision configuration, and at least one further bin for pallet ti^^. Then 
pallet ti^2 is opened and all bins for ti^2 are removed from qi. In this way the 
processing continues until all bins for pallet have been removed from qi. 

Then at least one bin for pallet ti^i+i is in the storage of the next decision 
configuration, which can be removed. After all bins for pallet have been 

removed step by step from qi, at least one bin for pallet has come into the 
storage. In this way the processing continues, and all pallets of plts{qi) which 
are also in plts{q'i) can be processed without to open any of the pallets t'l, . . . , t'i. 
To see this, consider the following two cases. 

If {p+\) ■ Xi < s -|- c, then p ■ Xi bins have been removed from qi by the above 
processing. Since 

^ i-l i-l 

Xi > - -I- 1 -I- Xj ^ p-Xi—{c+l) > Xj, 

^ i=i i=i 

there are at least s — Xj > s + c+l-p-Xi bins of sequence qi in storage S in 
decision configuration (g, S, k). Thus, after p-Xi bins have been removed from qi, 
at least the first bin b of g' has come into the storage. 

Otherwise, if (p -I- 1) • x, > s -I- c, then all pallets of plts{qi) except ti^i have 
been opened by the above processing, and since X]j=i < s — 1 the first bin b 
of g' has come into the storage. Since the pallets have been opened in the order 
defined by the bins in sequence g', the first bins in g' are destined for open 
pallets. After their removal the next pallets can be opened. 
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After all bins for the pallets in •plts{q'^) have been removed from g, at least 
one bin for pallet tm+i.i is in the storage of the next decision configuration, 
because < s by assumption. Therefore, all bins for can be re- 

moved one after the other. After that all bins for tm+1,2 can be removed one 
after the other. In this way the processing continues until all bins for the pal- 
lets tm+1,1, ■ ■ ■ ,tm+i,p are removed from q. At the end of the processing the 
pallets are opened one after the other. 

Let A be an arbitrary deterministic on-line algorithm with a storage capacity 
of s -|- c bins. Algorithm A opens the first pallet in some decision configuration 
{q,Si,ji), where ji is the position of the first bin of q[. Algorithm A does not 
know the pallet missing in q[. Without loss of generality, we can assume 
that t'l is the pallet opened in configuration {q, Si,j\). Otherwise, we redefine q'^ 
by using the first pallet t opened by algorithm A for t\ . Since A is a deterministic 
on-line algorithm and | gi | = s -I- c, pallet t opened by algorithm A is uniquely 
specified by sequence q\ . 

The next decision configuration is (q, 5'2,j2), where j’2 is the position of the 
first bin of 52 • Here again, in configuration (g, ^2, J2) algorithm A does not know 
the pallet t'2 missing in Thus, we can assume that t'2 is the pallet opened in 
configuration (g, S'2, J2)i otherwise we redefine g^. Generally, in decision configu- 
ration (g, where ji is the position of the first bin in g', algorithm A does 

not know the pallet t' missing in g'. So, we can redefine sequence g step by step 
such that A opens in the first m decision configurations the pallets t[, ... ,tm- 

By the definition of gm-i-i, algorithm A needs p additional stack-up places 
and thus altogether p + m stack-up places to process (s,p)-sequence g with a 
storage capacity of s -I- c bins. □ 

5 An Optimal On-Line Stack-Up Algorithm 

For some position j and a pallet t G cut{q,j) let h{q,j, t) be the number of bins 
for pallet t which are on a position less than j in g. 

Lemma 2. Let q be an (s,p) -sequence, let (g, Sc,j) be any decision configuration 
with respect to s c for some given integer c > 0, and let {q,S,j) be a corre- 
sponding configuration of an (s,p) -processing of q. LetTc = open{{q,Sc,j)), and 
T = open{{q,S,j)). Then it holds 

-(c+ V h{q,j,t)) < ma.x{h{q,j,t)}. 

teTc-T 

Proof. Omitted due to space restrictions. □ 

Theorem 3. For each integers s, p > 0, and c > 0 the MF algorithm needs at 
mostp-\-l stack-up places to process an (s,p) -sequence q with a storage capacity 
0/ s -I- c bins, where Vj < * for the following definition of yi 

ifi = l 

r^(c + E-=i%)l */*>!■ 



Vi = 
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Proof. Consider some configuration (g, Sc,j) during a processing of q by the MF 
algorithm with a storage capacity of s-|-c bins such that \open{{q, S'c, j))| is max- 
imal. Let {q,S,j) be a corresponding configuration during an (s,p)-processing 
of q. Let Tc = open{{q, ScJ)), T = open{{q, S,j)), and Tc~T = {ti, . . . ,b}. Fur- 
thermore, let (g, S'c,i, ji), . . . , (< 7 , Sc,i,ji) be the decision configurations in which 
the pallets ti, . . . respectively, are opened by the MF algorithm, where ji < 

. . . < ji- Let Tc,i, 1 < i < I, denote the set of open pallets in configuration 
(<?, Sc,i,ji), and let Ti denote the set of open pallets in a corresponding configu- 
ration (q,Si,ji) during the (s,p)-processing of q. Then it holds 

> max { h{qji,t) }, 

tfc J- i J- c,i 

because the MF algorithm opens always a pallet with the most number of bins 
in the storage. This holds also for pallet ti in configuration {q, Sc,i,ji). And by 
lemma 2 it holds 

max { h{q,j,,t) } > -(c-h V h{q,ji,t)). 

teTi-Tc i V \ ^ ' / 

’ teTc,i-Ti 

For Z > z > A: > 1 it holds h{q,ji,tk) > h{q,jk, tk) by definition. Furthermore, 
none of the pallets t G {ti, . . . , ti_i} is open in any configuration (q, 5",z'), 
i' < ji, during the (s,p)-processing of q, because such a pallet t € open{{q, S' , i')) 
would remain open up to configuration (q, S,j), and therefore pallet t has to be 
in T. This contradicts the assumption that the pallets are not in 

open{{q, S, j)). Thus, we get 

1 1 

X! H9,ji,'t)) > -(^c+^h{q,jk,tk)y 

^ teT^,i-Ti ^ k=l 

Since h{q,ji,U) is an integer, we get h{q,ji,ti) > \^{c+Yjjk=i K<1, 3k,tk)y\- The 

maximum I such that X)i=i h{q,ji^ti) < s is an upper bound on the number of 
pallets which are open in configuration (q,Sc,j) and not open in configuration 
{q,SJ). By lemma 1 it holds h{q,ji,ti) > [^1. □ 

Corollary 2. Most-Frequently is an optimal on-line stack-up algorithm up to at 
most one stack-up place. 

Proof. Let Xi and pi be as defined in theorem 2 and 3, respectively. In theo- 
rem 2 we have seen, that for any on-line stack-up algorithm A there are (s,p)- 
sequences q such that A takes at least p -\- m stack-up places to process q with 
a storage capacity of s -I- c bins, where m is the largest integer that satisfies 
< s- theorem 3 we have seen, that the MF algorithm takes at most 
p -\- 1 stack-up places to process (s,p)-sequences with a storage capacity of s -I- c 
bins, where I is the largest integer that satisfies Ui < s. It can be shown 

that Hi < Xi < Ui+i for z > 1. Therefore, m and I differ at most by one. 
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Theorem 4. The number of stack-up places the MF algorithm takes to process 
{s,p) -sequences with storage capacity s -\- c is less than 

P + P- log2 + l) + 1. 

Proof. Consider the proof of theorem 3. There we have seen that 



h{q,ji,U) > 



1 

-(c+ . 

^ fc=i 



1-1 ^ 

It can be shown that (c+ 1) • (2 p — 1) < X^i=i ti) for each I > 1. That 

means, if s < (c+l)-(2 p — 1), then it holds s < ji, Since 

s < (c+l)-(2V-l) p . iog 2 + 1 < 

the maximum number of pallets which are open in some configuration (g, Sc,j) 
during the processing of q by the MF algorithm and not open in configuration 
(g, S', j) during an (s,p)-processing is less thanp-log 2 (^;^+l)+l. In configuration 
(g, Sc,j) there are at most p pallets open that are open in (g, S,j), too. □ 
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Abstract. Consider the execution of a parallel application that dynam- 
ically generates parallel jobs with specified resource requirements during 
its execution. We assume that there is not sufficient knowledge about the 
running times and the number of jobs generated in order to precompute 
a schedule for such applications. Rather, the scheduling decisions have 
to be made on-line during runtime based on incomplete information. We 
present several on-line scheduling algorithms for various interconnection 
topologies that use some a priori information about the job running times 
or guarantee a good competitive ratio that depends on the runtime ratio 
of all generated jobs. All algorithms presented in this paper have optimal 
competitive ratio up to small additive constants. 



1 Introduction 

The efficient operation of parallel computing systems requires the best possible 
use of the resources that a system provides. In order to achieve an effective 
utilization of a parallel machine a smart coordination of the resource demands 
of all currently operating applications is necessary. Consequently, the task of a 
scheduler is to cleverly assign the resources, most prominently the processors, to 
the jobs being processed. For the case of sequential jobs, i.e., jobs that require 
exactly one processor for execution, the involved scheduling problems have been 
studied intensively for decades [2] . But in many situations the problem arises to 
find a schedule for a set of parallel jobs [4,5,2]. 

The model studied in this paper assumes that each parallel job demands a 
fixed number of processors or a specified sub-system of a certain size and topology 
(depending on the underlying structure of the parallel machine considered) for 
its execution. It is not possible to run a parallel job on fewer processors than 
requested, and additional processors will not decrease the running time. This 
reflects the common practice that the decision on the number of processors is 
made before a job is passed to the scheduler based on other resource requirements 
like memory, disk-space, or communication intensity. The processors must be 
allocated exclusively to a job throughout its execution, and a job cannot be 
preempted or restarted later. This is a reasonable assumption because of the 



K.-Y. Chwa and O. H. Ibarra (Eds.): ISAAC’98, LNCS 1533, pp. 119—129, 1998. 
(c) Springer-Verlag Berlin Heidelberg 1998 



120 Stefan Bischof and Ernst W. Mayr 



large overhead for these activities on parallel machines. Furthermore, there may 
be precedence constraints between the jobs. A job can only be executed if all of 
its predecessors have already completed execution. Most frequently, precedence 
constraints arise from data dependencies such that a job needs the complete 
input produced by other jobs before it can start computation. 

We are concerned with on-line scheduling throughout this paper to capture 
the fact that complete a priori information about a job system is rarely avail- 
able. However, it has been shown [6,9] that the worst-case performance of any 
deterministic or randomized on-line algorithm for scheduling parallel job systems 
with precedence constraints and arbitrary running times of the jobs is rather dis- 
mal, even if the precedence constraints between the jobs are known in advance. 
Therefore, we study the case that there is some a priori knowledge about the 
execution times of the individual jobs but the dependencies are unknown to the 
scheduler. 

Three different gradations for this additional knowledge are studied in this 
paper. The first model of runtime restrictions requires that all job running times 
are equal and that this fact is known to the on-line scheduler. We give a level- 
oriented on-line algorithm for this problem that repeatedly schedules a set of 
available jobs using Bin Packing and collects all jobs that arrive during a phase 
for execution in the next phase. We show that this algorithm is 2.7-competitive 
if the First Fit heuristic is used. Due to a lower bound of 2.691 for every 
deterministic on-line scheduler (cf. [1]), our algorithm is almost optimal. 

We then explore the entire bandwidth between unit and arbitrary execution 
times and capture the variation of the individual job running times by a charac- 
teristic parameter that we call runtime ratio (the quotient of the longest and 
shortest running time) . Our second model postulates that the runtime ratio of a 
job system is reasonably small and that the on-line scheduler knows the shortest 
execution time (but not the runtime ratio itself). A family of job systems with 
runtime ratio Tr > 2 is given that bounds the competitive ratio of any deter- 
ministic on-line scheduler by (Tr -I- 1) /2 from below. We note that the structure 
of the dependency graph is an out-forest in all of our lower bound proofs. This 
bound remains valid even if the scheduler knows the actual runtime ratio in 
advance. An on-line scheduler designated RRR (Restricted Runtime Ratio) for 
parallel systems supporting arbitrary allocations is described, and we demon- 
strate a competitive ratio of Tr/2 -|- 4 for this algorithm for any job system 
with runtime ratio < Tr. Therefore, the RRR algorithm is nearly optimal up to 
a small additive constant. The assumption that the shortest execution time is 
known to the on-line scheduler can be dropped without much loss of competitive 
performance. In fact, the modified algorithm for this third model is Tr/2 -|- 5.5 
competitive (see [1] for details). 

The remainder of this paper is organized as follows. In Section 2 we introduce 
our scheduling model, some notation and definitions. We then discuss previous 
and related work on on-line scheduling of parallel jobs in Section 3. Section 4 
presents nearly optimal on-line schedulers for jobs with unit execution time, 
whereas in Section 5 we study job systems where the ratio of the running times 
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of the longest and shortest job is bounded. Again, we describe and analyze on-line 
scheduling algorithms that are optimal up to small additive constants. Finally, 
we present our conclusions in Section 6. 

2 Preliminaries 

Let N denote the number of processors of the parallel computer-system at hand. 
A (parallel) job system is a non-empty set of jobs JT” = { Ji, J 2 , . ■ ■ , Jm} where 
each job specifies the type and size of the sub-system that is necessary for its 
execution together with precedence-constraints among the jobs in given as 
a partial order ^ on If Jq ^ Jf, then cannot be scheduled for execution 
before Ja is completed. A task is a job that requires one processor for execution. 

A schedule for a job system is an assignment of the jobs to processors 

and start-times such that: 

— each job is executed on a sub-system of appropriate type and size, 

— all precedence-constraints are obeyed, 

— each processor executes at most one job at any time, 

— jobs are executed non-preemptively and without restarts. 

The interconnection topology of the parallel computer-system may impose seri- 
ous restrictions on the job types that can be executed efficiently on a particular 
machine. On a hypercube, for example, it is reasonable to execute jobs only on 
subcubes of a certain dimension rather than on an arbitrary subset of the pro- 
cessors. On the other hand, a number of interconnection networks do not restrict 
the allocation of processors to parallel jobs. Therefore, the various types of in- 
terconnection networks have to be treated separately. In this paper we study the 
complete model which assumes that a job Ja requests Ua processors {1 < ria < N) 
for execution and any subset of processors of size ria may be allocated. Results 
for hypercubes can be found in [1]. 

It is always possible to transform a job system (f7, into a directed acyclic 
graph D = {J,E) with (Ja^Jb) £ E ^ Ja Jb- Removing all transitive edges 
from D we obtain the dependency graph induced by (J7, ^). We call two jobs Ja 
and Jb dependent if Ja -< Jb or Jb ^ Ja, and independent otherwise. We shall use 
the terms dependency and precedence- constraint interchangeably in this paper. 
The length of a path in the dependency graph induced by is defined as the 

sum of the running times of the jobs along this path. A path is called critical if its 
length is maximum among all paths in the dependency graph induced by (J7, ^). 
A job is available if all predecessors of this job have completed execution. An 
on-line scheduling algorithm is only aware of available jobs and has no knowledge 
about their successors. We assume that the on-line scheduler receives knowledge 
about a job as soon as the job becomes available. This event, however, may 
depend on earlier scheduling decisions. 

The efficiency of a schedule at any time t is the number of busy processors 
at time t divided by N. In general, the running time of a job is also unknown 
to the on-line scheduler and can only be determined by executing a job and 
measuring the time until its completion. In Section 4, though, we study the case 
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of unit execution times and therefore restrict the on-line model there to the case 
of unknown precedence-constraints. 

Throughout the paper we use the notations shown in Table 1 (cf. [9,6]) for a 
given job system {J 

Our goal is to generate schedules with minimum makespan, i.e. to minimize 
the completion time of the job finishing last. We evaluate the performance of our 
on-line scheduling algorithms by means of competitive analysis. A deterministic 
on-line algorithm Alg is called c-competitive if Talg ^ cTopt for all job systems 
and arbitrary N. The infimum of the values c G [1, oo] for which this inequality 
holds is called the competitive ratio of Alg. The competitive ratio clearly is 
a worst-case measure. It is intended to compare the performance of different 
on-line algorithms that solve the same problem, since it is in general impossible 
to compute an optimal solution without complete knowledge of the problem 
instance. An optimal on-line algorithm is one with a best possible competitive 
ratio. 



3 Previous and Related Work 

Extensive work on non-preemptive on-line scheduling of parallel jobs with or 
without dependencies was done by Feldmann, Kao, Sgall and Teng [6,9,7]. 
However, these results for general parallel job systems are bad news for users of 
parallel computers since they show that no deterministic on-line scheduler for N 
processors can have competitive ratio better than N. That is, the competitive 
ratio is asymptotically unbounded, and even randomization cannot improve this 
unsatisfactory situation substantially. 

One possibility to improve the performance is to restrict the maximum job 
size to XN processors, 0 < A < 1. Given this restriction it has been shown that 
the GREEDY algorithm is optimal for the complete model with competitive ratio 
1-1- Setting A = 1/2, for example, yields a 3-competitive algorithm. Another 
possible alternative is the use of virtualization. This means that a parallel job Jq 
which requests Uq processors is executed on a smaller number of processors n/ 
by the use of simulation techniques with a predetermined increase in running 
time. Under the assumption of proportional slowdown (the running time of a 
job is enlarged by the factor ria/n/) it can be shown that there is an optimal 
on-line scheduler for the complete model with competitive ratio 2 + <l>, where 
= (-\/5 — l)/2 is the golden ratio. The two approaches just described can 



Table 1. Frequently used notations 



2opt 


Length of an optimal off-line schedule for {J, 


Talg 


Length of a schedule for {J, generated by Algorithm Alg 


Tmax 


Maximal length of any path in the dependency graph induced by (J, 


^min 


Minimal running time of any job in J 


^max 


Maximal running time of any job in U 


Tcol 


Total time of a schedule for (Y, when the efficiency is less then 

a, 0 < a < 1 
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be combined to yield an optimal on-line scheduler with competitive ratio 2 -|- 
complete model. 

Both approaches, though, have a severe drawback that arises due to the 
memory requirements of parallel jobs. Restricting the maximum size of a job to 
XN processors can thus severely restrict the problem size that can be solved on a 
particular machine. This is often unacceptable in practice because solving large 
problems is the main reason for the use of parallel computers besides solving 
problems fast. Virtualization may be impossible or prohibitively expensive if 
such memory limitations exist. 

The job systems used in the lower bound proofs in [6,9] for the general case 
reveal an unbounded ratio of the running times of the longest and shortest 
job. Therefore, we think it necessary to study the influence of the individual 
running times on the competitive ratio of on-line schedulers for our scheduling 
problem. To gain insight into this relationship it is only natural to start with unit 
execution times as is done in Section 4. It turns out that the problem becomes 
manageable with small constant competitive ratio even if nothing is known about 
the precedence constraints. 

To fill the gap between these two extremes — totally unrelated running times 
versus unit execution times — we identify the runtime ratio as the distinctive 
parameter of a job system for the achievable competitive ratio. Our results for 
the proposed on-line schedulers in Section 5 demonstrate a smooth, linear tran- 
sition of the competitive ratio from the case of unit execution times to unrelated 
execution times that is governed by the runtime ratio. 



4 Jobs with Unit Execution Time 



In this section, we restrict our model to the case where all jobs have the same 
execution time. When the dependency graph is known to the scheduler this prob- 
lem has been intensively studied by Garey, Graham, Johnson and Yao [8]. 
We show that similar results hold in an on-line environment, where a job is 
available only if all its predecessors have completed execution. 

The Level algorithm (see Fig. 1) collects all jobs that are available from 
the beginning. Since available jobs are independent we can easily transform the 
problem of scheduling these jobs to the Bin Packing problem: the size of a 
job divided by N is just the size of an item to be packed, and the time-steps 
of the schedule correspond to the bins (see [3] for a survey on Bin Packing). 
Let Pack be an arbitrary Bin Packing heuristic. We parameterize the Level 
algorithm with PACK to express the fact that a schedule for a set of independent 
jobs is generated according to Pack. Thereafter, the available jobs are executed 
as given by this schedule. Any jobs that become available during this execution 
phase are collected by the algorithm. After the termination of all jobs of the 
first level a new schedule for all available jobs is computed and executed. This 
process repeats until there are no more jobs to be scheduled. 
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Algorithm Level (Pack): 

begin 

while not all Jobs are finished do 
begin 

j\ \= { J ^ \ J is available }; // next EPT level 

schedule all Jobs in A according to Pack; 
wait until all scheduled Jobs are finished; 

end; 

end. 



Fig. 1. The Level (Pack) algorithm 

We now show that the First-Fit (FF) Bin-Packing heuristic is a good choice 
for Pack. FF considers all partially filled bins as possible destinations for the 
item to be packed. An item is placed into the first (lowest indexed) bin into 
which it will fit. If no such bin exists, a previously empty bin is opened and the 
item is placed into this bin. 

Theorem 1. Level(FF) is 2.7 -competitive. 

The proof of this theorem uses the following results from [8] involving a piecewise 
linear weighting function W : [0, 1] — > [0,8/5]. 

Lemma 1. Let B denote a set of items with total size < 1. Then 

^ W(size(6)) < E. 

bGB 

Theorem 2. If L is a list of items with sizes < 1, then 

FF(L) < ^ lT(size(a;)) -I- 1, 

xEL 

where FF(L) denotes the number of bins used by First-Fit to pack L. Now we 
are ready to prove Theorem 1: 

Proof. Let be a job system with unit execution time and arbitrary dependen- 
cies. We define 

W{J) = y] W(size(j)). 

Thus VF(j7) is the total weight of all job sizes. Let I be the number of levels of 
the job system. For 1 < f < / let C/i be the set of jobs of each level. By Theorem 2 
we can upper bound the length of the partial schedule for each level i, 1 < i < I, 
generated by Level(FF): 



Tlevel(FF)(C,) < wm + 1. 

We can think of an optimal packing of fl with the dependencies removed as a 
partition of into J* sets each of which has total size < 1. Applying Lemma 1 
yields VF(J') < Together with the fact that the length of the optimal 
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schedule for J without dependencies cannot be longer than the length of the 
optimal schedule for J we conclude: 

i i 

21:.evel(FF) = ^ ^ ^Level(FF)(^») < E {W{U,) + 1) = W{J) + / < 1.7 Topt + 1 . 

i—1 i—1 

Since I = Tmax < Topt, the result follows. □ 

Note that the results of this section remain valid if we assume a 1-dimensional 
array of length N as interconnection topology instead of using the complete 
model, since the Bin Packing algorithms assign consecutive processors to the 
jobs and the assignments in different time-steps are independent from each other. 



5 Parallel Job Systems with Restricted Runtime Ratio 

We have shown in the preceding section that on-line scheduling of parallel jobs 
with unit execution time and precedence-constraints is possible with small con- 
stant competitive ratio. On the other hand, if execution times are arbitrary, there 
exists no on-line scheduler with acceptable worst-case performance. It is only nat- 
ural to explore the case that job runtimes are restricted by some criterion other 
than unit execution time in order to achieve a respectable competitive ratio. For 
a set of jobs 17 we therefore define the runtime ratio RR = RR(l7) := tmax/imin- 

In this section we study the problem of on-line scheduling parallel job sys- 
tems with dependencies where the runtime ratio is bounded from above by a 
parameter Tr > 1 which is not known to the on-line scheduler. This problem 
often arises in practice when upper and lower bounds for the running time of 
a job are known in advance but the actual running time is unknown. This sit- 
uation also makes clear that the parameter Tr cannot be used as additional 
information for scheduling decisions by the on-line scheduler and is therefore 
not part of the problem instance. Indeed, our results show that this knowledge 
is not necessary for the on-line scheduler to achieve a near optimal competitive 
ratio that depends only on Tr. 

First, we give a lower bound of (Tr + I)/2 for the asymptotic competitive 
ratio of any deterministic on-line scheduler for this problem. For simplicity we 
normalize the running time of the shortest job to 1 . The job system used in this 
lower bound argument is very simple and consists of N layers with two tasks and 
one parallel job of size N on each layer. The parallel job depends on one of the 
tasks on the same layer and is predecessor of both tasks of the following layer. 
The task scheduled first by the on-line scheduler is assigned running time Tr 
and the remaining task runs for 1 unit of time and is predecessor of the parallel 
job. Clearly, the makespan of any schedule generated by an on-line scheduler is 
at least iV(TR -I- 1). If Tr is sufficiently large (e.g., Tr > 2), the optimal solution 
first schedules the critical path which has length 2N followed by the tasks of 
length Tr in parallel. 
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The competitive ratio of any deterministic on-line scheduler is thus lower 
bounded by 



N{Tr + 1) ^ Tn + l 

‘IN + AT— *-oo 2 



Algorithm RRR 
begin 

while Li ^ 0 do schedule a big job exclusively; 

while not all Jobs are finished do 

begin 

while L2 7^ 0 do schedule small jobs greedily; 
if Li 7^ 0 then 

if a big job can be scheduled then do it; 
elsif a. > 1/2 then 

wait for a scheduled job to finish; 
else / / start of a delay phase 
begin 

collect small jobs that become available during the next 2 units of time; 
schedule those jobs greedily and then wait for all scheduled jobs to finish; 
while Li 7^ 0 do schedule a big job exclusively; 

end; 

else wait for next available job; 

end; 

end. 



Fig. 2. The RRR algorithm 



We now describe an algorithm designated RRR (see Fig. 2) that achieves 
competitive ratio Tr/ 2 -|- 4. A key feature of this algorithm is the distinction 
between big jobs that request more than half of the total number of processors 
and small jobs with size < [7V/2J. Let a := a{t) denote the efficiency at time t. 
The RRR algorithm tries to keep the efficiency at least 1/2 whenever possible. 
There are two reasons that hinder the RRR algorithm from achieving this goal. 
First, there might be no job available and second, there might be not enough 
processors available to schedule a big job. Therefore, the RRR algorithm must 
prevent big jobs from being delayed too long in order to bound the fraction of the 
total schedule length with low efficiency. This is done by occasionally stopping 
to schedule small jobs, if all big jobs request more processors than currently 
available and the efficiency is below 1/2. 

We present two versions of the RRR algorithm. The first one assumes 
that tmin is a known quantity. Again, we normalize the running time of the 
shortest job to 1 and a unit of time refers to this normalized time quantum. In 
the second version we remove this assumption and employ an adaptive waiting- 
strategy to maintain a comparable competitive ratio. The RRR algorithm main- 
tains two sets, Li and L 2 , containing the available big respectively small jobs. 
We assume that any job that becomes available is immediately inserted into the 
appropriate set, and we will not state this activity explicitly in the pseudo-code 
description of the RRR algorithm. 

Theorem 3. The RRR algorithm is {Tn/2 + 4) -competitive for any job system 
{J, and RR(l7) < Tr. 
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Proof. We partition the schedule generated by the RRR algorithm into 3 differ- 
ent kinds of phases: 

1. Efficiency is at least 1/2. 

2. Efficiency is below 1/2 and there is no job available. 

3. Efficiency is below 1/2 and the algorithm waits for the termination of all 
jobs. 

We refer to the third type as a delay phase and denote the total time of each kind 
by T^i/ 2 j Tnojob , and Tdeiay respectively. The total time of the RRR schedule 
that is spent in phases of type 1 and 2 can easily be bounded by 3 Topt , because 
we have T>i /2 < 2 Topt by a straightforward area-argument and Tnojob < ?max < 
Topt by Graham’s critical path argument. 

It remains to show that Tdeiay < {Tn/2 + l)Topt- We define a delayed job 
as a big job that was available at the beginning of a delay phase. Let ti denote 
the start time of delay phase i. First, we bound the length of a delay phase 
by Tr + 2. If no small jobs become available during the first two units of time 
after the beginning of a delay phase, no more jobs are scheduled until all currently 
running jobs terminate. Since the running time of any job is no more than Tr , 
such a delay phase lasts at most time Tr. On the other hand, if small jobs 
become available during the first two units of time, then these are collected and 
scheduled greedily at time = ti + 2 (resp. t® < -|- 2 if all jobs running at 

time ti terminate before two units of time have elapsed) in addition to those jobs 
still running at time tf . If the total size of these small jobs is no more than the 
number of idle processors at time if, they can be scheduled immediately. Clearly, 
the length of a delay phase is bounded by Tr -|- 2 in this case. Should the total 
size of the small jobs exceed the number of idle processors at time tf we can 
schedule enough small jobs to raise the efficiency above 1/2 as long as small jobs 
that were collected during the interval [ti,tf] are available. The time-span while 
the efficiency is at least 1/2 is, of course, a phase of type 1 and not part of the 
delay phase. Clearly, the length of the second part of a delay phase is bounded 
by Tr and therefore the length of a delay phase is always bounded by Tr + 2. 

Let d denote the number of delay phases in a schedule generated by the RRR 
algorithm. We distinguish two cases: 

1. d = 1: We have to show that the optimal solution needs at least time 2. 
This follows immediately from the fact that each delayed job must have a 
predecessor in the job system because otherwise it would have been scheduled 
earlier. 

2. d > 1: This case will be proven by constructing a chain of jobs in the 
dependency graph with total execution time at least 2d. From that we 
have Topt > 2d and together with Tdeiay < d{Tn + 2) the claim follows. 

The construction of this chain proceeds as follows: Starting with an arbitrary 
delayed job that is scheduled after delay phase d we observe that there must be a 
small job that is ancestor of this delayed job and is scheduled immediately after 
the delayed jobs of delay phase d — I (i.e. without having a small job as direct 
predecessor that is itself scheduled after the delayed jobs of delay phase d — 1) 
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because otherwise this delayed job would have been scheduled earlier. We add 
such a small job at the front of the chain. To augment the chain, we state the 
possibilities for the direct predecessor of a small job that is scheduled by the 
RRR algorithm immediately after the delayed jobs of delay phase i: 

Type 1: Delayed job of delay phase i or big job that is successor 
of a delayed job of delay phase i, 

Type 2: Small job collected during delay phase i, 

Type 3: Small job running from the beginning of delay phase i. 

This is due to the fact that the RRR algorithm schedules all small jobs that are 
available by time before the delayed jobs of delay phase i. 

We continue the construction inductively according to these three possibili- 
ties. If there is a direct predecessor of Type 1 of the small job that is currently 
head of the list, we can repeat the initial construction step of the chain and add 
a delayed job and its small ancestor at the front of the chain. When there is no 
direct predecessor of Type 1 but a direct predecessor of Type 2, we add 2 more 
jobs at the front of the chain: the Type 2 job and a direct predecessor of this job 
that was running at the beginning of the delay phase during which this Type 2 
job was collected. Finally, if there is only a direct predecessor of Type 3, we add 
this job at the front of the chain. The inductive construction stops as soon as 
the head of the chain is a small job that is scheduled before the delayed jobs of 
the first delay phase. 

To complete the proof, we show that the total execution time of the jobs 
along this chain is at least 2d. The construction of the chain starts with 2 jobs, 
a delayed job and its small ancestor. Since the minimum running time of any 
job is 1, these 2 jobs need at least 2 units of time for execution in any schedule. 
If the construction proceeds by adding a Type 1 job, the same argument applies. 
Continuing with a Type 2 job means that again 2 more jobs with were added 
to the chain. If a Type 3 job is encountered, we know that this job must have 
execution time at least 2 because it is direct predecessor of a small job that is 
scheduled immediately after the delayed jobs of the delay phase the Type 3 job 
belongs to. Thus, for each delay phase in the schedule generated by the RRR 
algorithm, the above construction adds jobs with total execution time at least 2 
to the chain. □ 



The assumption that fmin is known to the RRR algorithm can be dropped 
by employing an adaptive waiting strategy without much loss in competitive 
performance: 

Theorem 4. The RRR_Adaptive algorithm is {Tr/ 2 + 5. 5) -competitive for 
any job system (jT, and RR(f7) < Tr. 



Proof. Omitted due to space restrictions (see [1]). 



□ 
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6 Conclusion 

We have presented and analyzed several on-line scheduling algorithms for par- 
allel job systems. It has become evident that runtime restrictions improve the 
competitive performance achievable by on-line schedulers. Therefore, if enough a 
priori knowledge on job running times is available to bound the runtime ratio of 
a job system, our schedulers can guarantee reasonable utilization of the parallel 
system. But even without any such knowledge the RRR_Adaptive algorithm 
produces schedules that are almost best possible from a worst-case point of view. 
All on-line algorithms considered in this paper are computationally simple, and 
thus the scheduling overhead involved can safely be neglected, provided that the 
system has suitable means to deliver the necessary load information. 
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Abstract. We consider the problem of testing the roundness of a man- 
ufactured object using the finger probing model of Cole and Yap [1]. 
When the object being tested is a disk and it’s center is known, we de- 
scribe a procedure which uses 0{n) probes and 0{n) computation time. 
(Here n = |l/g|, where q is the quality of the object.) When the center of 
the object is not known, a procedure using 0(n) probes and 0(n log n) 
computation time is described. When the object being tested is a cylin- 
der of length I, a procedure is described which uses 0{lin?) probes and 
0{ln^ log In) computation time. Lower bounds are also given which show 
that these procedures are optimal in terms of the number of probes used. 



1 Introduction 

The field of metrology is concerned with measuring the quality of manufactured 
objects. A basic task in metrology is that of determining whether a given manu- 
factured object is of acceptable quality. Usually this involves probing the surface 
of the object using a measuring device such as a coordinate measuring machine 
to get a set S of sample points, and then verifying, algorithmically, how well S 
approximates an ideal object. 

A special case of this problem is determining whether an object is round, or 
circle like. For our purposes, an object / is good if the boundary of / can be 
contained in an annulus of inner radius 1 — e and outer radius 1 + e, for some 
quality parameter e > 0, and is had otherwise. See Fig. 1 for examples of good 
and bad objects. We call this problem the roundness classification problem. 

Little research has been done on probing strategies for the roundness classi- 
fication problem. A notable exception is the work by Mehlhorn, Shermer, and 
Yap [4], in which a probing strategy for manufactured disks is coupled with a 
roundness testing algorithm. Unfortunately, the procedure described in [4] relies 
on the assumption that the object / is convex. It is usually not the case that the 
manufacturing process can guarantee this. 

In this paper we describe strategies for testing the roundness of manufactured 
disks and cylinders. We use the finger probing model of Cole and Yap [1]. In this 
model, the measurement device can identify a point in the interior of / and can 
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Fig. 1. Examples of good and bad objects. 



probe along any ray originating outside of /, i.e., determine the first point on the 
ray which intersects the boundary of I . The finger probing model is a reasonable 
abstract model of a coordinate measuring machine [5] . 

This work extends the results of [4] in several ways. The assumption that the 
object / is convex is replaced by a much weaker assumption related to visibility. 
Using this assumption, we give a procedure for testing the roundness of a man- 
ufactured disk I using 0(|l/qual(/)|) probes and 0(|l/qual(/)| log |l/qual(/)|) 
computation time. Here |qual(/)| measures how far the object / is from the 
boundary between good and bad. For testing the roundness of a manufac- 
tured cylinder J we describe a procedure that uses 0(^/qual( J)^) probes and 
0(Z/qual( J)^ log(Z/qual( J)^)) computation time, where I is the length of J. We 
also give lower bounds which show that our procedures are optimal, up to con- 
stant factors, in terms of the number of probes used. 

The remainder of the paper is organized as follows: Section 2 introduces 
definitions and notation used throughout the remainder of the paper. Section 3 
describes procedures for testing the quality of manufactured disks. Section 4 
presents a procedure for testing the quality of manufactured cylinders. Section 5 
gives lower bounds on the number of probes needed to solve these problems. 



2 Definitions, Notation, and Assnmptions 

In this section, we introduce definitions and notation used throughout the re- 
mainder of this paper, and state the assumptions we make on the object being 
tested. For the most part, notation and definitions are consistent with [4]. 

For a point p, we use the notation x(p), y(p), and z(p) to denote the x, y, 
and 2 ; coordinates of p, respectively. The letter O is used to denote the origin 
of the coordinate system. We use the notation dist(a, 6) to denote Euclidean 
distance between two objects. When a and b are sets of points, dist(a,6) is the 
minimum distance between all pairs of points in a and b. The angle formed by 
three points a, b, and c, is denoted by Zo6c, and we always mean the smaller 
angle unless stated otherwise. 
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A planar object I is defined to be any compact simply connected subset of 
the plane, with boundary denoted by bd(/). For a point p, we use R(p,I) and 
r(p, I) to denote the maximal and minimal distance, respectively, from p to a 



point in bd(/). I.e., 

R{p,I) = max{dist(p,p') : p S bd(/)} (1) 

r{p,I) = min{dist(p,p') : p' S bd(/)} . (2) 

For a point p, let 

qual(p, I) = min{r(p, /) - (1 - e), (1 + e) - i?(p, /)} (3) 

qual(/) = max{qual(p, /) : p G IR^} . (4) 



Any point c/ with qual(c/, I) = qual(/) is called a center of I. The value qual(/) 
is called the quality of the object I. An object / with qual(J) > 0 is good while 
an object / with qual(/) < 0 is bad. A procedure which determines whether a 
planar object is good or bad is called a roundness classification procedure. 

In order to have a testing procedure which is always correct and which termi- 
nates, it is necessary to make some assumptions about the object I being tested. 
The following assumption made in [4] is referred to as the minimum quality 
assumption, and refers to the fact that the manufacturing process can guaran- 
tee that manufactured objects have a minimum quality (although perhaps not 
enough to satisfy our roundness criterion). 

Assumption 1. i?(c/,7) < 1 -I- (5 and r(c/, J) > 1 — i5, for some constant 0 < 
S < 1/21, i.e., the boundary of / is contained in an annulus of inner radius 1 — i5 
and outer radius 1 -I- i5. 

The minimum quality assumption alone is not sufficient. If the object under 
consideration contains oddly shaped recesses, then it may be the case that these 
recesses cannot be found using finger probes. We say that an object I is star- 
shaped if there exists a point k £ I such that for any point p € I, the line segment 
joining k and p is a subset of I. We call the set of all points with this property 
the kernel of I. The following assumption ensures that all points in bd(/) can 
be probed by directing probes close to the center of I. 

Assumption 2. / is a star-shaped object, and its kernel contains all points p 
such that dist(c/,p) < a, for some constant 1 — 6 > a > 2d. 

We observe that our assumptions are weaker than those in [4]. 

Observation 1. The set of convex objects satisfying Assumption 1 is strictly 
contained in the set of objects satisfying Assumptions 1 and 2. 

3 Testing Disks 

3.1 The Simplified Procedure 

In this section we describe a simplified testing procedure which assumes that we 
know the object being tested is centered at the origin, O. The motivation for 
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describing this simplified procedure is pedagogical; it is a simple example which 
helps in understanding the full procedure. 

Our testing procedure tests the roundness of an object / by taking a set S of 
probes at uniform intervals directed at the origin. We use the notation probe(n,p) 
to denote the set of points obtained by taking n probes directed at the point p 
in directions 2-k jn, . . . — jn. The procedure repeatedly doubles the 

size of the sample until either (1) a set of sample points is found which cannot be 
covered by an annulus of inner radius 1 — e and outer radius 1 + e, in which case / 
is rejected, or (2) the set of sample points can be covered by an annulus with 
inner radius sufficiently larger than 1 — e and outer radius sufficiently smaller 
than 1 + e, in which case we can be sure that / is a good object. 



Procedure 1 Tests the roundness of the object / centered at the origin. 

1: r ^ 1 
2: i? ^ 1 
3: n <— no 
4: A ^ f{n) 

5: repeat 

6: S ^ probe(n, O) 

7: if 3p G S : dist(p, O) > 1 + e or dist(p, O) < 1 — e then 

8: return REJECT 

9: end if 

10: r <— 1 — e + Z\ 

11: R^l + e-A 

12: n <— 2n 

13: Z\ ^ fin) 

14: until Vp G 5 : dist(p, O) < R and dist(p, O) > r 
15: return ACCEPT 



The function /(n) which appears in the procedure is defined as 



With these definitions, we obtain the following crucial lemma. 

Lemma 1. Let I be a planar object with center cj. Let S be the set of results of 
n > no probes directed at cj in directions 0, 27r/n, 47r/n, . . . , 27r(n — l)/n. Then 
for any point p G bd(/), there exists a point p' G S such that dist(p,p') < fin). 

Proof. Assume wlog that cj = O, x(p) = 0, and 1 — 5 < y(p) <1 + 5. We will 
upper-bound |x(p)— x(p')| and |y(p) — yip')\- Refer to Fig. 2 for an illustration. 




and the constant uq is defined as 



no = [tt/ arctan(o;/(l + 5))] < 70 



(6) 
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Fig. 2. Constraints on the position of p'. The point p' must be in the shaded 
region, and dist(p,p') is maximized when p' is placed as shown. 



First note that there exists a sample point p' £ S such that 0 < Lpcjp' < 7r/n. 
By Assumption 1, dist(0,p') < 1 + (5, so an upper bound on |x(p)— x(p')| is 

|x(p)-x(p')| = |x(p')l < (1 + (5)sin(7r/n) (7) 

< (1 + (5)7r/n (8) 



Since /.pcip' < 7r/n, p' must lie in the cone defined by the inequality 



yip') > |x(p')l 



cos(7r/n) 

sin(7r/n) 



(9) 



Next we note that the slope of the line through p' and p must be in the range 
[— y(p)/a,y(p)/a], otherwise Assumption 2 is violated. If n > no, then the region 
in which p' can be placed is bounded, and |y(p) — yip')\ is maximized when p' 
lies on one of the bounding lines 



fi{x) = xy{p)/a + y{p) (10) 

frix) = -xyip)/a + y(p) (11) 

Since both lines are symmetric about a; = 0, assume that x(p') lies on //, giving 

\y{p) - y{p)\ < \y{p) - M^ip))\ (12) 

= |x(p')y(p)/a| (13) 

< |x(p')(l + 5)/a| (14) 

< (1 + (5)^7r/an (15) 



Substituting (8) and (15) into the Euclidean distance formula and simplifying 
yields the stated inequality. □ 



Theorem 1. There exists a roundness classification procedure that can correctly 
classify any planar object I with center cj = O and satisfying Assumptions 1 
and 2 using 0(|l/qual(/)|) probes and 0(|l/qual(/)|) computation time. 
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Proof. We begin by showing that the procedure is correct. We need to show that 
the procedure never rejects a good object and never accepts a bad object. The 
former follows from the fact that the procedure only ever rejects an object when 
it finds a point on the object’s boundary which is not contained in the annulus 
of inner radius 1 — e and outer radius 1 + e centered at c/. 

Next we prove that the procedure never accepts an bad object. Lemma 1 
shows that there is no point in bd(/) which is of distance greater than /(n) from 
all points in S. The procedure only accepts / when all points in S are of distance 
at least A = /(n) from the boundary of the annulus of inner radius 1 — e and 
outer radius 1 + e centered at O. Therefore, if the procedure accepts I, all points 
in bd(/) are contained in an annulus of inner radius 1 — e and outer radius 1 + e, 
i.e., the object is good. 

Next we prove that the running time is 0(|l/qual(/)|). First we observe that 
/(n) S 0{l/n). Next, note that the computation time and number of probes 
used during each iteration is linear with respect to the value of n, and the value 
of n doubles after each iteration. Thus, asymptotically, the computation time 
and number of probes used are dominated by the value of n during the last 
iteration. There are two cases to consider. 

Case 1: Procedure 1 accepts I. In this case, the procedure will certainly termi- 
nate once A < qual(I). This takes 0(log(l/qual(/))) iterations. During the final 
iteration, n € 0(l/qual(/)). 

Case 2: Procedure 1 rejects I. In this case, there is a point on bd(/) at distance 
qual(/) outside the circle with radius 1 + e centered at O, or there is a point in 
bd(/) at distance qual(/) inside of the circle with radius 1 — e centered at O. 
In either case. Lemma 1 ensures that the procedure will find a bad point within 
0(log |l/qual(/)|) iterations. During the final iteration, n S 0(|l/qual(/)|). □ 

3.2 The Full Procedure 

The difficulty in implementing Procedure 1 is that we may not know the position 
of the exact center, c/, of I . However, the following result from [4] allows us to 
use this procedure anyhow. 

Theorem 2 (Near-Center). Let I be a planar object with center cj and which 
satisfies Assumption 1. Then 6 probes and constant computation time suffice to 
determine a point cq such that dist(c/,co) < 25. 

We call any such point cg a near-center of I. As the following lemmata show, 
knowing a near center is almost as useful as knowing the true center. Before we 
state the lemma, we need the following definitions. 




(16) 

(17) 



ng = [tt/ arctan(a/(l + 3(5))] 
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Lemma 2. Let I he a planar object with center c/ and near-center cq. Let S be 
the set of results of n> Uq probes directed at cq in directions 0, 27r/n,47r/n, . . . , 
27r(n —l)/n. Then for any point p € bd(/), there exists a point p' £ S such that 
dist(p,p') < /'(n). 

Proof. The proof is almost a verbatim translation of the proof of Lemma 1, 
except that we assume that cq = O. With this assumption we derive the bounds 

|x(p) - x(p')l < (1 + 36){ir/n) (18) 

|y(p) - y(p')l < (1 + 3(5)^7r/n(a - 2,5) | (19) 

Substituting these values into the formula for the Euclidean distance and sim- 
plifying yields the desired result. □ 

Lemma 3. Let L be a planar object with center c/ and near-center cq. Let S 
he the set of results of n probes directed at cq in directions jn, An jn, , 

2n{n— l)/n, and let cs be the center of S. Then 

R{cs, S) < R{cs, I) < R{cs, S) + f{n) (20) 

r{cs,S) - f{n) <r{cs,I) <r{cs,S) . (21) 

Proof. We prove only the bounds on the R{cs,I) as the proof of the bounds 
on r(cs,I) are symmetric. The lower bound on i?(cs,/) is immediate, since 
S C bd(/). To see the upper bound, choose any point p € bd(/) such that 
dist(cs,p) = R{cs,I). By Lemma 1 there exists p' € S such that dist(p',p) < 
f'{n). Therefore dist(cs,p) < dist(cs,p') -I- f'{n), which implies that R{cs,I) < 
R{cs,S) -\- f{n). □ 

Lemma 4. Let L be a planar object with center c/ and near-center cq. Let S 
he the set of results of n probes directed at cq in directions 0, 27r/n, 47r/n, . . . , 
27r(n — l)/n. Then quads') — f'{n) < qual(J) < qual(S') 

Proof. The proof can be found in [4]. 

Theorem 3. There exists a roundness classification procedure that can correctly 
classify any planar object L satisfying Assumptions 1 and 2 using 0(l/|qual(/)|) 
probes and 0(|l/qual(/)| log |l/qual(/)|) computation time. 

Proof. We make the following modifications to Procedure 1. In Line 3, we set 
the value of n to tiq. In Lines 4 and 13, we replace f(n) with f'{n). In Line 6 we 
directed our probes at cq rather than O. In Lines 7 and 14, we replace the simple 
test with a call to one of the O(nlogn) time referenced roundness algorithms 
in [2] or [3] , to test whether the sample set S can be covered by an annulus with 
the specified inner and outer radius. 

Lemma 4 ensures that the procedure never accepts a bad object and never 
rejects a good object, i.e., the procedure is correct. The procedure terminates 
once f'{n) < |qual(/)|. This happens after 0(log |l/gita/(/)|) iterations, at which 
point n e 0(|l/qual(/)|. □ 
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4 Testing Cylinders 

Before we can describe a quality testing procedure for cylinders, we must gen- 
eralize the notion of quality to cylinders. In this work, we are concerned with 
the roundess of cylinders, and not their height, or the flatness of their ends. For 
these reasons, we will assume that our manufactured cylinders have length I, 
and that their ends are perfectly flat. The object, J, that we are interested in 
testing is a compact simply connected subset of the space {x, y, [0, Z]). 

We assume that J is resting on the {x,y) plane and that we know the ori- 
entation of the cylinder. Define Jh to be the set of all points (a;, y) such that 
(x, y, h) S J. Note that Jh is a planar object. We define the outer boundary of J 
as Jout = Uo</i<i and we define the inner boundary of J as Jin = rio</i<i 
For a point p on the plane, we use R{p, J) and r(p, J) to denote the maximal 
and minimal distance, respectively, from p to a point in bd( Jout) and bd(Jin), 
respectively. For a point p, let 



We call any point cj with qual(cj, J) = qual(J) a center of J. Note that accord- 
ing to these definitions J is an good object if there exists an annulus of inner 
radius 1 — e and outer radius 1 -I- e which covers both bd( Jin) and bd(Jout)- 
We require the following minimum quality assumptions. 



Assumption 3. R{cj, J) < 1 -I- <5 and r{cj, J) > 1 — <5, for some constant 
0 < (5 < 1 / 21 . 



Assumption 4. For all 0 < /i < Z, Jh is a star-shaped object, and its kernel 
contains all points p such that dist(cj,p) < a, for some constant 1 — S > a > 26. 

Assumption 5. Let J be an object with center cj, and let J[h — a, h+a] be the 
subset of points in J with z coordinate in [h — a,h + a\. Let K be the intersection 
of the infinite length cylinder of radius a, which is perpendicular to the (a;,p) 
plane and is centered at O with J[h — a, h + a]. Then J[h — a, h + a] is a star 
shaped object with kernel K. 

Note that Assumptions 3 and 4 allow us to And a near-center cq of J in 
constant time. Our procedure for testing cylinders is exactly the same as the 
procedure for testing disks described in Sect. 3.2, except that during each iter- 
ation, we perform lnj2'K sets of probes along the planes z = 0, z = 2 tt jn,z = 
Att jn^ . . . , z = Z, where each set contains n probes directed at cq. Note that 
the number of probes performed is 0{ln^). After collecting these sample points, 
they are projected onto the (a;, p)-plane, and the algorithm of [2] or [3], deter- 
mine whether there exists an annulus of inner radius r and outer radius R which 
contains them. 

As in Sect. 3, let us define the function 



qual(p, J) = min{r(p, J) - (1 - e), (1 -k e) - i?(p, J)} 
qual( J) = max{qual(p, J) : p G . 



( 22 ) 

(23) 



f'{n) = f{n) + - (((1 -k i6)-K/aY + " 

n ^ ^ 



( 24 ) 
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Lemma 5. Let J he an objeet of length I with center cj, and satisfying Assump- 
tions 3, and 5. Let cq he any point such that dist(c/,co) < 26, and let S be 
a set of ln/2-K, n > Uq, probes directed at cq as described above. Then for any 
point p G bd( J) there exists a point p' G S such that dist(p,p') < f"{n). 

Proof (Sketch). Note that S contains a ring of probes S' such that for all p' G S, 

\z{p') — z{p)\ < Tr/n. With reference to Fig. 3, one can prove that dist(p,p") < 

f'{n) using the same arguments used in the proof of Lemma 2. Using Assump- 

1 

tion 5, once can also prove that dist(p",p') < ^ (((1 + 3(5)7r/a)^ + tt^) ^ . The 
stated bound then follows from the triangle inequality. □ 




Fig. 3. The proof of Lemma 5. 



With this result, and following the arguments of Sect. 3.2, it is not difficult 
to prove the correctness and running time of the testing procedure for cylinders. 

Theorem 4. There exists a roundness classification procedure for cylinders that 
can correctly classify any object J of length I and satisfying Assumptions 3, j, 
and 5 using 0(Z/qual(J)^) probes and 0(Z/qual(J)^ log(^/qual( J)^)) computa- 
tion time. 

5 Lower Bounds 

In this section, we give lower bounds which show that the number of probes used 
in our testing procedures is optimal up to a constant factor. These lower bounds 
hold even if the center of the object is known in advance. We begin with a lower 
bound for planar objects. 

Theorem 5. Any roundness classification procedure that is always correct re- 
quires, in the worst case, I7(|l/qual(/)|) probes to classify a planar object L with 
center cj = O and satisfying Assumption 1 and Assumption 2. 

Proof (Sketch). Let / and I' be the two objects depicted in Fig. 4. For any 
0 < "0 < e, qual(/) = — qual(/') = 0. It is not difficult to see that if a procedure 
uses o(|l/0|) probes, then by rotating /' appropriately, it is possible to “hide” 
the recess in I' so that the procedure cannot differentiate between / and □ 
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2-2e+2\|/ 




Fig. 4. The proof of Theorem 5. 



A similar argument can be used for cylinders. The difference being that the 
recess in the object J' is a circular cone. That the recess in J' can be hidden 
comes from the fact that a set of v? points in the unit square always contains 
an empty circle of radius Q{l/n)} 

Theorem 6. Any roundness classification procedure for cylinders that is always 
correct requires, in the worst case, l7(^/qual(J)^) probes to classify an object J 
with center ci = O and satisfying Assumptions 3, 4, and 5. 

Acknowledgement 

The authors would like to thank Silvia Gotz for carefully reading several early 
revisions of this paper. 

References 

1. Richard Cole and Chee K. Yap. Shape from probing. Journal of Algorithms, 8:19-38, 
1987. 129 

2. Mark de Berg, Prosenjit Bose, David Bremner, Suneeta Ramaswami, and Gordon 
Wilfong. Computing constrained minimum-width annuli of point sets. In Proceed- 
ings of the 5th Workshop on Data Structures and Algorithms, pages 25-36, 1997. 
135, 136 

3. Christian A. Duncan, Michael T. Goodrich, and Edgar A. Ramos. Efficient approx- 
imation and optimization algorithms for computational metrology. In 8th ACM- 
SIAM Symposium on Data Structures and Algorithms (SODA ), pages 121-130, 1997. 
135, 136 

4. Kurt Mehlhorn, Thomas C. Shermer, and Chee K. Yap. A complete roundness 
classification procedure. In ACM Symposium on Computational Geometry, pages 
129-138, 1997. 129, 130, 131, 134, 135 

5. Chee K. Yap. Exact computational geometry and tolerancing metrology. In David 
Avis and Jit Bose, editors. Snapshots of Computational and Discrete Geometry, Vol. 
3. 1994. 130 

^ To see this, divide the square into a (n J- 1) x (n + 1) grid. Some cells in this grid 
must have no points in them, therefore an empty circle of diameter l/(n J- 1) can be 
placed in one of these empty cells. 



Casting with Skewed Ejection Direction* 



Hee-Kap Ahn, Siu-Wing Cheng, and Otfried Cheong 

Department of Computer Science, HKUST, Hong Kong 
{pastel , scheng, otf ried}@cs . ust .hk 



Abstract. Casting is a manufacturing process in which liquid is poured 
into a cast (mould) that has a cavity with the shape of the object to be 
manufactured. The liquid then hardens, after which the cast is removed. 
We address geometric problems concerning the removal of the cast. A 
cast consists of two parts, one of which retracts in a given direction car- 
rying the object with it. Afterwards, the object will be ejected from the 
retracted cast part. In this paper, we give necessary and sufficient condi- 
tions to test the feasibility of the cast part retraction and object ejection, 
where retraction and ejection directions need not be the same. For poly- 
hedral objects, we show that the test can be performed in 0(n® log n) 
time and the cast parts can be constructed within the same time bound. 
We also give a polynomial time algorithm for finding a feasible pair of 
retraction and ejection directions for a given polyhedral object. 



1 Introduction 

The manufacturing industry has at its disposal a wide variety of processes for 
constructing objects, including gravity casting, injection molding [7,10,15], lay- 
ered manufacturing (as, for instance, stereolithography [3]), material removal via 
conventional (or chemical or electrical) machining, deformation (forging, rolling, 
extrusion, bending), composition (as in composite materials, sintered ceram- 
ics, and the like), and spray deposition. In all of these manufacturing contexts, 
CAD/CAM systems of growing sophistication are presently being introduced. 
Geometric computing has thus become ubiquitous in the manufacturing indus- 
try, and more and more real-world objects begin their life as geometric objects 
modeled within a computer. The survey by Bose and Toussaint [4,6] gives an 
overview of these geometric problems and algorithms arising in this area. 

The casting process [9,16] consists of two stages. First, liquid is poured into a 
cavity formed by two cast parts. After the liquid hardens, the movable cast part 
first retracts from the fixed cast part carrying the object with it. Afterwards, 
the object is ejected from the retracted cast part. In most existing machinery, 
the retraction and ejection directions are identical, and previous work on this 
problem has assumed this restriction on casting. Existing technology for injection 
molding, however, already has the flexibility to accomodate an ejection direction 
that is different from the retraction direction of the moving cast part. Exploiting 
this possibility allows to cast more parts, or to cast parts with simpler moulds, 
and is the subject of the present paper. 
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To simplify our discussion, we will pretend that it is not the part that is 
ejected from the moving cast part, but that the cast part is removed from the 
part. In this way, we have symmetry between the two cast parts, and both 
retraction and ejection are modelled conceptually by removal of a cast part. To 
simulate the retraction, the fixed cast part will first be removed in a direction 
opposite to the retraction. Then to simulate the ejection, the remaining cast part 
will be removed in a direction opposite to the ejection. To summarize, in our 
model of casting, the two cast parts are to be removed in two given directions 
and these directions need not be opposite. Note that the ordering of removal is 
important. 

The cast parts should be removed from the object without destroying either 
cast parts or the object. This ensures that the given object can be mass produced 
by re-using the same cast parts. The casting process may fail in the removal of 
the cast parts: if the cast is not designed properly, then one or more of the cast 
parts may be stuck during the removal phase. The problem we address here 
concerns this aspect: Given a 3-dimensional object, is there a cast for it whose 
two parts can be removed after the liquid has solidified? An object for which 
this is the case is called castable. 

The 2-dimensional version of castability problem has been studied by Rappa- 
port and Rosenbloom [15]. They presented an 0(n) time algorithm to determine 
whether a simple n- vertex polygon can be decomposed into two monotone chains, 
which is a sufficient and necessary condition for the polygon to be castable. Hui 
and Tan [12] gave a heuristic approach to the 3-dimensional problem, assuming 
opposite directions for cast removal. It is based on testing candidate directions 
using sample points, and may not find a feasible direction, or may incorrectly 
return an infeasible direction. Kwong [13] gave the first complete algorithm to 
determine the feasibility of a given parting direction. He reduced the problem 
to the hidden surface removal problem in computer graphics by observing that 
if all the facets can be completely illuminated from the parting direction and 
its opposite, then the parting direction is feasible. Chen et al. [8] show how 
to find the parting direction that maximizes the number of completely visible 
“cavities” in the object. This direction, however, may not be a good parting 
direction even if one exists. Based on Chen et al.’s work, Hui [11] gave exponen- 
tial time algorithms that also take cores and inserts into account. Again they 
are not guaranteed to find a feasible cast. Bose et al. [5] considered a special 
model of casting, the sand casting model, where the partition of the cast into 
two parts must be done by a plane. Note that even convex polyhedra are not 
always castable in this model [5]. 

Finally, Ahn et al. [2] gave, to our knowledge, the first complete algorithm 
to determine the castability of polyhedral parts for opposite directional cast re- 
moval. Given a simple polyhedron with n vertices, they presented an 0(n log n)- 
time algorithm to compute castability in a given direction. They also presented 
an 0(n^)-time algorithm to compute all combinatorially distinct directions for 
which there is a good cast. They showed that there exist polyhedra for which 
there are 17 (n^) combinatorially distinct directions in which there is a good cast. 
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In this paper we give a complete characterization of castability, under the 
assumption that the cast has to consist of two parts that are to be removed 
in two not necessarily opposite directions. We also give an algorithm to verify 
this condition for polyhedral objects. We do not assume any special separability 
of the two cast parts, and allow parts of arbitrary genus. The running time of 
our algorithm for determining the castability of an object with a given pair of 
directions is O(n^logn). 

All the results for opposite cast parts removal in [2,12,13] rely on the property 
that an object is castable if its boundary surface is completely visible from the 
two opposite removal directions. This is not true when the removal directions 
are non-opposite: there are polyhedra whose whole boundary is visible from the 
removal directions but which are not castable with respect to those directions [2] . 

For completeness, we also give an 0{n^^ logn)-time algorithm for finding all 
combinatorially distinct feasible pairs of removal directions. Though the running 
time is polynomial, the algorithm is clearly of theoretical interest only. 



2 Preliminaries 

Throughout this paper, V denotes a polyhedron, that is, a (not necessarily con- 
vex) solid bounded by a piecewise linear surface. The union of vertices, edges, 
and facets on this surface forms the boundary of V, which we denote by bd(T^). 
We require bd(T^) to be a connected 2-manifold. Each facet of 7^ is a con- 
nected planar polygon, which is allowed to have polygonal holes. Two facets 
of V are called adjacent if they share an edge. We assume that adjacent facets 
are not coplanar — they should be merged into one — but we do allow coplanar 
non-adjacent facets. We also assume that V is simple, which means that no two 
non-adjacent facets share a point. Our assumptions imply that V may contain 
tunnels, but no voids — a polyhedron with a void is not castable anyway. 

Our characterization of castability applies to general objects Q. This is im- 
portant since many industrial parts are not polyhedral. However, our algorithms 
are only designed for polyhedra. 

We call the two removal directions the red and the blue direction, and we 
denote them by dr and di, respectively. We assume that the outer shape of 
the cast equals a box denoted by B, which is large enough so that the object 
is contained strictly in its interior. This assumption is necessary for producing 
connected cast parts. Our goal is to decompose the cast into two parts which 
only overlap along their boundaries. The cast part to be removed first is called 
the red part and is denoted by Cr- The other cast part is called the blue part and 
is denoted by Cb- The removal directions for Cr and Cb are dr and db respectively. 
Each of Cr and Cb is a connected subset of B. The union of Q and CrUCb equals B. 
Note that S \ (C^ U Cb) is an open set and c\{B \ (Cr U Cb)) is the object to be 
manufactured. 
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3 A Characterization of Castability 

We call an object Q castable with respect to {dr,dh) if we can translate C 
to infinity in direction d,. without collision with int(Q) and int(Cb), and then 
translate C& to infinity in direction d^ without collision with int(Q). The order 
of removal is important. 

We illuminate Q with two sources of parallel light. The red light source is 
at infinity in direction d,. and the blue light source is at infinity in direction d^. 
We say that a point p in space is illuminated by red light if a red ray from 
direction d,. can reach p without intersecting int(Q). The definition for a point p 
being illuminated by blue light is similar. Note that we assume that a light ray 
will not stop when it grazes the boundary of Q. 

There is a (possibly disconnected) subset of \ Q not illuminated by red 
light. We call it the red shadow volume and denote it by Vr- Similarly, there 
is a subset of S \ Q not illuminated by blue light. We call it the blue shadow 
volume and denote it by Vb- If we sweep Vb to infinity in direction dr, then we 
will encounter a set of points in B and we denote this set of points by . Note 
that includes Vb itself. 

Lemma 1. If Q is castable, then Vr ^ Cb and V^ f=Cr- 

Proof. By definition, ;B \ Q is contained in Cr U Cb and so both Vr and Vb are 
contained in Cr Li Cb- Take any point p in V^. If we move p in direction dr to 
infinity, then p will be stopped by int(Q) as p does not receive any red light. So p 
cannot be a point in the red cast part Cr- Thus, Vr C Cb- By similar analysis, 
Vb C Cr- Since Q is castable, Cr can be translated first to infinity in dr with- 
out colliding with int(Q) and int(Ch). Since Vb C Cr, we conclude that Vf C Cr- □ 

We are now ready to prove the necessary and sufficient condition for an object 
to be castable. 

Theorem 1. Given an object Q and removal directions (dr,db), Q is castable 
if and only ifVr lies in one connected component of cl{B \ (V^ U Q)). 

Proof. First, we prove that the condition is necessary. Since Q is castable, 

C Cr and Vr C Cb by Lemma 1. Since Cb Q cl{B \ {Cr U Q)), we have 
Vr G Cb G cl{B \ (Cr U Q)) C cl(;B \ (V^ U Q)). Therefore, if Vr does not lie 
in one connected component of cl(B \ (V^ U Q)), then so does Cb. This implies 
that Cb is not connected, a contradiction. 

Second, we prove the sufficiency of the condition. Without loss of generality, 
let dr be the positive vertical direction. Let / be the top facet of B. Let M he a 
thin layer of B below / and above Q. M exists as Q lies strictly in the interior 
of B. Take the connected component R of B\ (V^ U Q) that contains Vr. Define 
the blue cast part Cb to be cl(i? \ M). Then define the red cast part Cr to be 
c\{B\{Q\JCb)). 

We first argue that each cast part is connected. We can ensure that Cb is 
connected by adjusting the thickness of M. Since both Cr and M are completely 
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illuminated by red light, every connected component of Cr overlaps with M and 
hence Cr is connected. 

We now show that the cast parts can be removed in order. Since Cr is com- 
pletely illuminated by red light, Cr can be translated to infinity in dr without 
colliding with int(Q). We also claim that this translation of Cr cannot be ob- 
structed by int(Cb). Otherwise, a point p in Cr can see a point q in int(Cb) in 
direction d,.. If the line segment pq contains a point in V^, then q also belongs 
to which is a subset of Cr by Lemma 1. Thus, q G Cr, a contradiction. If the 
line segment pq does not contain any point in V^, then pq lies B\{V^ U Q). Since 
q G int(Cb), p also belongs to the connected component of S \ (V^ U Q) contain- 
ing Vr- Thus, we also include p in Ct- Since q € int(Ch), we can use the same 
argument for points in the neighborhood of q to show that every point in the 
neighborhood of p belongs to Cb- This implies that p G int (Cb) which contradicts 
that p G Cr- 

After removing Cr, Cb can be removed to infinity in db without colliding with 
int(Q) because Cb does not contain any point in Vb by definition. 0 

The condition in Theorem 1 also implies that Vr H Vb is empty. However, 
unlike the case where the two removal directions are opposites of each other, the 
emptiness of Vr H Vb does not guarantee castability. Figure 1 shows an object 
that is not castable, even though Vr H Vb is empty. 




Fig. 1. An object Q and its shadow volume. Vr intersects two connected com- 
ponents of S \ int(Vj U Q). 



4 Feasibility Test for a Polyhedron 

In this section, we present an 0(n^ log n)-time algorithm for testing the feasibility 
of a pair of removal directions (dr,db) for a given polyhedron V. Throughout 
this section, we treat dr as the positive vertical direction and we assume that V 
lies above the xp-plane. We use £{p) to denote the vertical line through a point p. 
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The red shadow, denoted by Sr, is the complement of the set of points on 
bd(T^) that can be illuminated by red light from dr without being obscured by 
int(T^). The blue shadow, denoted by Sb, is the complement of the set of points 
on bd(T^) that can be illuminated by blue light from db without being obscured 
by int(T^). Their intersection Sr H Sb is called the black shadow. 

For each polyhedron edge e, let hb{e) denote the plane through e and parallel 
to db- Then e is a blue silhouette edge if it satisfies two requirements. The first 
requirement is that the two facets incident to e lie in a closed halfspace bounded 
by hb{e) and the dihedral angle through int(T^) is less than tt. The second re- 
quirement is that if a facet incident to e is parallel to db, then e should be behind 
that facet when viewing from direction db- A lower blue silhouette edge is a blue 
silhouette edge e where V lies above hb{e) locally at e. Similarly, an upper blue 
silhouette edge is a blue silhouette edge e where V lies below hb{e) locally at e. 

For each lower blue silhouette edge e, imagine that e is a neon tube shooting 
blue rays in direction —db- We trace the “sheet” of blue rays emanating from e 
until they hit int(T^), or hit an edge or facet parallel to db and above int(T^) 
locally, or reach infinity in direction —db- The union of these intercepted or 
unintercepted blue rays define a subset of the plane hb{e) called a lower blue 
curtain. Note that a lower blue curtain may pass through a facet of V parallel 
to db. Such a facet must then be locally above V. 

For each upper blue silhouette edge e, we define an upper blue curtain simi- 
larly. We trace the “sheet” of blue rays emanating from e until they hit int(T^), 
or hit an edge or facet parallel to db and below int(T^) locally, or reach infinity 
in direction —db. The union of these intercepted or unintercepted blue rays form 
an upper blue curtain. Note that an upper blue curtain may pass through a facet 
of V parallel to db. Such a facet must then be locally below V. 

Given a blue silhouette edge e, we use T(e) to denote the blue curtain defined 
by e. If r{e) is nonempty, then it is bounded by a silhouette edge e, edges parallel 
to db called the side edges, and a set ^(e) of polygonal chains opposite to e. 

We divide castability testing into three steps. We first verify that the bound- 
ary of V is completely illuminated by red and blue light. That is, Sr n Sb is 
empty. Once this test is passed, we then check whether Vr O V& is empty. If this 
test is passed, then we construct the red and blue cast parts and verify that they 
are connected. 

4.1 Testing Emptiness of Black Shadow 

The emptiness of Sr n Sb can be tested in 0{n^ logn) time as follows. We put a 
horizontal plane H above V and compute the projection of V onto H with the 
hidden portion removed. The resulting arrangement is known as the visibility 
map. We project this visibility map vertically downward on the boundary of V . 
This tells us which part of bd(T^) is illuminated by red light. An edge in the 
visibility map is the projection of a polyhedron edge. A vertex in the visibility 
map is the projection of a polyhedron vertex or the intersection between the pro- 
jections of two polyhedron edges. Clearly, the size of the visibility map is 0{n^). 
It can be computed in 0{n^ logn) time using a plane sweep over the projection 
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of all polyhedron edges to remove the hidden line segments. Output-sensitive 
algorithms for visibility map computation are also known [1]. Thus, determining 
the parts of bd(T^) illuminated by red light can be done in 0{n^ log n). Similarly, 
we can determine the parts of bd(T^) illuminated by blue light in 0{n? logn). We 
can then decide whether Sr H Sb is empty by testing the intersection separately 
on every facet of bd(T^), for instance with a plane sweep algorithm. In total, this 
test takes time O(n^logn). 

4.2 Testing Emptiness of Black Shadow Volume 

Once we know that Sr H Sb is empty, we can determine if the black shadow 
volume is empty by examining the projections of blue curtains on the icy-plane. 
This is more efficient than computing Vr H Vb directly. We show how this is 
done in the following. Let tt be the projection function. We first need a technical 
lemma. Its proof can be found in the full paper. 

Lemma 2. Suppose that Vr H Vb is empty. Then for any two blue silhouette 
edges e and f, £{p) D e is not above £{p) n T{f) for all points p € 7r(e) O 
int(7r(r(/))). 

The emptiness of the black shadow is now determined by the necessary and 
sufficient condition stated in the following result. 

Lemma 3. Suppose that Sr H Sb is empty. Then Vr H Vb is non-empty if and 
only if for some lower blue silhouette edge e and blue silhouette edge f , £{p)C\e is 
not below £{p) n T{f) for some point p in 7r(e) n int(7r(T(/))) or 7r(e) n 7t(^(/)). 

Proof. We prove sufficiency first. Suppose that p is a point in the intersection of 
7r(e) and int(7r(r(/))) such that £{p) fl e is not below £{p) n T{f). By Lemma 2, 
£{p) n r(e) is also not above £{p) n T{f). Thus, £{p) n T(e) = £{p) n T{f). We 
claim that T(f) is an upper blue curtain, otherwise by definition, the interior 
of T{f) would not contain the point £{p) n T(e) as e is a lower blue silhouette 
edge. Now, we can shift £{p) as in the proof of Lemma 2 to show that an arbitrary 
short segment on £{p) below £{p) n T{f) does not receive red or blue light. Thus 
Vr n Vb is nonempty. 

The remaining alternative is that 7r(e) touches 7r(^(/)) at a point p where 
£{p) n e is not below £{p) n T{f). If we shift £{p) slightly in direction db to a 
vertical line £, we claim that £ must intersect the interior of a facet a incident 
to e. Otherwise, since e is a lower blue silhouette edge, a facet incident to e 
would face downward and direction —db and so this facet would contain a point 
in black shadow, a contradiction. Observe that £ also intersects the interior 
of r{f). Whether ^nint(cr) is above or equal to £nT(/), we can argue as before 
to conclude that Vr H Vb is nonempty. 

We now prove necessity. Any facet of cl(Vr H Vb) is parallel to db, or parallel 
to dr, or on bd(T^). At least one facet a is parallel to db, otherwise cl(Vr n Vb) 
would be unbounded. The facet a cannot receive any red light as it bounds the 
black shadow volume. Thus, a must receive some blue light, otherwise cr would 
be in black shadow which is supposed to be empty. Therefore, cr must lie on 
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some blue curtain -T(/). Let ^ be a point in int((r). If we shoot a ray upward 
from z, the ray hits bd(T^) at a point v{z). Suppose that we move z in the direc- 
tion —db. The height of v{z) from T(/) is monotonically decreasing and remains 
non-negative. Otherwise, there would be a position such that v{z) becomes a 
point in the black shadow which is impossible. Before or just when £{z) stabs an 
edge g of C{f), v{z) reaches an edge e such that V lies locally on one side of a 
vertical plane through e. Otherwise, a facet adjacent to g would contain a point 
in black shadow, a contradiction. Observe that e is also a lower blue silhouette 
edge, and £{z) O e does not lie below £{z) n T(/). Clearly, 7r(e) either intersects 
the interior of 7t(T(/)) or touches 7t(^(/)) at the intersection of £{z) and the 
ly-plane. 0 

To test the condition in Lemma 3, we identify all blue silhouette edges, 
construct the blue curtains, and check the condition in the projection of the blue 
curtains by a plane sweep. The blue silhouette edges can be identified in 0{n) 
time. To construct its blue curtain we intersect a plane with V in 0(n log n) 
time, and so we can construct all blue curtains in time 0{n? logn). Consider the 
condition in Lemma 3. For a lower blue silhouette edge e and a blue silhouette 
edge /, observe that if £{p) n e is not below £{p) C T(/) for some point p in 
7r(e) nint(7r(T(/))), then £{q)f\e is not below £{q)r\r{f) for an endpoint of 7r(e) 
clipped to int(7r(r(/))). Thus, it suffices to compute the projection of all blue 
curtains and examine the heights of blue curtains at intersection points in the 
projection. We will show below that the arrangement formed by the projection of 
all blue curtains and blue shadow facets has 0{n^) complexity. This implies that 
we can determine whether Vr H Vb is empty in time 0{n^ log n) by computing 
this arrangement, for instance using a plane sweep, and verifying the heights of 
blue curtains at all vertices of the arrangement. 

It remains to prove the bound on the complexity of the arrangement formed 
by the projections. 

Lemma 4. Let V he a polyhedron and let (dr,db) be the given pair of removal 
direetions. If Sr H Sb is empty, then 

(i) For any blue silhouette edges e and f, no edge in 7r(^(e)) crosses an edge 

in 7 t (^(/)). 

(ii) For any blue silhouette edges e and f, if an edge in 7r(^(e)) crosses 7r(/) 

at a point p, then £{p) C ^(e) is not above £{p) C /. 

Lemma 5. Let V he a eastahle polyhedron. The arrangement formed by the 
projeetions of all blue curtains and all blue shadow facets on the xy-plane has 
complexity 0{n^). This bound is the best possible. 

Proofs can be found in the full paper. 

The optimality of the bound follows from the example in Figure 2. The 
polyhedron shown in Figure 2 (a) has three horizontal upper “legs” and four 
horizontal lower “legs”, and there is a row of small “teeth”. It is castable us- 
ing the removal directions shown in Figure 2 (b). Yet every one of the upper 
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“legs” creates a lower blue curtain of linear complexity, and that has a quadratic 
number of intersections with the curtains formed by the lower “legs” . It follows 
that the arrangement formed by the projections of all the lower curtains has 
complexity see Figure 2 (c). 




(c) 



Fig. 2. (a) A polyhedron with 4 vertical pyramids and three upper legs and four 
lower legs, (b)Some lower blue curtains, and (c) The projection of blue curtains 
onto the a;j/-plane. 



4.3 Cast Part Construction 

Finally, we show how to construct the red and blue cast parts. In the process, 
we also verify whether the cast parts are connected. 

Points on blue shadow facets and points close to and above lower blue curtains 
are not illuminated by blue light. So they can only be removed in direction dr- 
Other points encountered while translating these points towards infinity in d,. 
should then belong to Cr too. This is exactly the subset of B swept by the lower 
envelope of lower blue curtains and blue shadow facets in direction dr- Denote 
this subset of Bhy X. X may be disconnected. Since V is strictly contained in B, 
we can take a layer of material M beneath the top facet of B and above V and 
use M to connect all the components in X. By Lemma 5, we can compute the 
lower envelope of lower blue curtains and blue shadow facets in 0{v? log n) time 
and so A U M can then be computed in the same time. A U M is our potential 
red cast part. All the points in c\{B \ (A U M)) are removable in direction d\,. 
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So cl(S\ {X UM)) is our potential blue cast part. However, cl(,S\ {X UM)) may 
be disconnected. Thus, we will try to attach some components in cl{B\ {XUM)) 
to X U M instead. 

Such a process is guided by the condition in Theorem 1 . Observe that cl{B \ 
{XUM)) is a subset of B\{V^UV). From the above analysis, any blue cast part 
and hence Vr lies inside cl(,S \ {X U M)). Thus, we can attach every component 
of c\{B \ {X U M)) not containing any point in Vr to X U M. These components 
are removable in direction dr as they do not contain points in Vr- In addition, if 
there are more than one remaining component of c\{B \ (^ U M)) containing Vr, 
then we can abort and report that V is not castable. Otherwise, we have the 
cast parts. 

It is unnecessary to compute Vr- Every facet bounding Vr is connected to 
some red shadow facet. The red shadow facets can be computed in O(n^logn) 
time using visibility maps. Each red shadow facet lies on a facet bounding cl(S\ 
(XUM)). We identify the set of facets bounding cl(S\ (dfU M)) that contain the 
red shadow. Then we test whether this set of facets lie in the same component 
of c\{B \ (X U M)) using a linear-time graph traversal. 

Theorem 2 . Let V be a simple polyhedron with n vertices. Given a pair of 
directions, we can determine castability and construct cast parts, if castable, 
of V in 0 {n^ logn) time and O(n^) space. 



5 Finding a Pair of Directions 

We have seen how to test whether a polyhedron V is castable in a given pair 
of directions {dr, dt). In this section we briefly sketch an algorithm to solve 
the following problem: Given a polyhedron V, decide whether there is a pair of 
directions (dr, db) hr which V is castable. In fact, we will solve the more general 
problem of finding all pairs of directions (dr, dt) for which V can be cast. 

The set of all pairs of directions forms a 4 -dimensional parameter space L'. 
We choose an appropriate parameterization that gives rise to algebraic surfaces 
in !V, see for instance Latombe’s book [ 14 ]. Our goal is to compute that part of •V 
that corresponds to pairs of directions in which V is castable. As we have proven 
before, castability depends on a number of simple combinatorial properties: the 
emptiness of the black shadow, the configuration of the curtain projections, and 
the connectedness of the blue cast part. We will compute an arrangement of 
algebraic surfaces in iL that includes all pairs of directions where one of these 
properties could possibly change. The following lemma enumerates all relevant 
situations. Its proof can be found in the full paper. 

Lemma 6. Let 71 and 72 be two pairs of directions, such that V is castable 
in 7i but not in 72. Let tt be any path in 4 -dimensional configuration space •V 
connecting 71 and 72. Then on tt there is a pair of directions (dr, dj,) such that 
one of the following conditions holds: 
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(i) A facet ofV is parallel to or di,. 

(a) The projection in direction dr of a vertex v coincides with the projection 
of an edge e. Here edges and vertices are edges and vertices of V or of the 
blue shadow Sh- 

(Hi) Two polyhedron vertices lie in a plane parallel to the plane determined 
by dr and di,. 

This characterization can be turned into an algorithm that computes the ar- 
rangement of these surfaces and tests each cell separately. 

Theorem 3. Given a polyhedral object V with nvertices and edges, we can in 
time 0(n^® logn) construct a set of all possible pairs of directions in which V is 
castable. 
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Abstract. In high-quality image printing it is sometimes required to 
repair flaws contained in a given image. A simple way for such repair is 
to paste a flaw region with white and then to move those pixels in the 
neighborhood by using a tool called an copy-brush. Since it is a very 
fine operation, it causes great effort to human operators. It is not easy 
to automate this operation in the existing matrix representation of an 
image. In our geometric representation of an image as a collection of 
contour lines for intensity levels this problem is naturally defined as one 
of reconnecting those contour lines disconnected by a flaw region. An 
efficient algorithm for reconnecting contour lines is presented based on 
perfect matching and observations on geometric properties of intercon- 
nection paths. 



1 Introduction 

Recent remarkable development in the printing machine technology is toward 
more high quality with low costs. In the sense the most important problem is 
how to automate as many of printing processes based on human operators as 
possible. In this paper we present an automatic method for one of the tasks, 
repairing a flaw region in an image, which has been considered to be very hard 
to be automated. We also demonstrate its effectiveness by experimental results. 

In commercial films even a tiny flaw must be removed. Removing electric 
poles is also the case. An usual way in these cases is to specify a region to 
be removed as a flaw region and copy pixels from surrounding region by using 
so-called a ’’copy-brush” operator. As far as we remain within a framework of 
matrix representation of images, it is not so easy to automate this operation. 
The method we propose is based on a different representation of an image, 
called ’’contour representation”, which represents an image by a collection of 
contour lines concerning their intensity levels. This is just like a terrain map 
which is obtained by regarding an intensity level at each pixel as height at the 
corresponding location. This representation admits geometric (and thus global) 
treatment of an image. This is a fundamental idea behind our approach. 
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Assuming the contour representation of an image, the above-stated flaw re- 
pairing problem is naturally defined as the following geometric problem. That 
is, specified a flaw region by a simple polygon and removed all of contour lines 
included in it, those contour lines intersecting the boundary of the flaw region 
become disconnected. If we reconnect those disconnected contour lines as nat- 
ural as possible within the flaw region, the resulting image is expected to look 
natural. In this paper, we first describe how to reconnect those disconnected con- 
tour lines based on perfect matching and observations on geometric properties 
of interconnection paths. 

The authors do not intend to apply the algorithm in this paper to restore 
random-like texture images. For those random-like images the approach by Hi- 
rani and Totsuka [4] based on combined frequency and spatial domain informa- 
tion would be more appropriate. 



2 Contour Representation of an Image 

A discrete image is usually represented in a matrix form in which each element 
represents an intensity level of the corresponding pixel (in three matrices for a 
color image). Contour representation is a different way of representation of an 
image as a collection of contour lines with intensity levels as heights. A contour 
line for a level i is the boundary of a region consisting of pixels whose levels are 
greater than or equal to i. 

Our contour representation is different from what is commonly used in com- 
puter vision in the sense that contour lines exist between pixels in our method 
while they pass through the center of pixels in the common one. More con- 
cretely, a contour line consists of horizontal and vertical lattice edges between 
pixels. This difference is important especially for the applications dealt with in 
this paper in the definition of regions. 

3 Flaw Repairing Problem 

3.1 Properties of Contour Lines 

A flaw region to be removed is specified as a simple polygon F . Then, each con- 
tour line intersecting the boundary of F is disconnected by the removal of F . 
Thus, the problem is how to reconnect those disconnected contour lines ’’natu- 
rally.” Although it is not known what is the best way to reconnect those discon- 
nected contour lines so that the resulting image looks natural. Our experience 
based on experiments suggests that minimization of the total length of chords to 
be added to reconnect them leads to reasonable results in many cases. Therefore, 
we settle our goal in this paper to propose an efficient algorithm for connecting 
contour lines intersecting a flaw region so that the total length of chords to be 
added in the region for their interconnection is as small as possible. 

Intersections of contour lines with the boundary of a flaw region F, called 
” terminals” , are numbered in a clockwise manner along the boundary 
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as no, fi, . . . , Vn-i- Since a contour line is directed so that pixels of higher levels 
lie to its right, there are two types of terminals depending on the directions of 
their associated contour lines, that is, in one type they enter the flaw region F 
and in the other type they exit from F. The type of a terminal Vi defined like 
this is denoted by type(ni). It is defined by type(ni) = IN for those terminals 
at which contour lines enter F and by type(ni) = OUT otherwise. When the 
contour line associated with a terminal Vi is the one between levels k and fc + 1, 
it is denoted by level(fi) = k (see Fig. 1). 




Fig. 1. Classification of contour lines incident to a flaw region by their directions. 



No two contour lines pass through the same place in a terrain map unless they 
are from a cliff. Thus, if we assume that no two contour lines touch each other, 
the terminals change their associated levels continuously along the boundary of 
a flaw region. Thus, a terminal of level k is adjacent to those of levels of k — l,k 
or A: + 1. Here note that contour lines are directed so that higher level pixels lie 
to their right. This means the following constraints: That is, for a terminal Vi of 
level k with the type IN (OUT, respectively), its clockwise neighbor is of level 
fc — 1 (fc + 1, respectively) if it is of type IN (OUT, respectively) and fc otherwise 
(if their types are different). 

The above assumption does not hold in practice. It is rather common in 
a real image that more than two contour lines of different levels pass through 
the same location. To treat such an image just like a terrain map the following 
convention suffices: When two pixels of levels j and k {j < k) are adjacent on 
the boundary of F, (fc — j) contour lines pass through between them and so 
we create k — j terminals associated with those levels (although their physical 
locations are the same). More precisely, if they are IN-type terminals, they are 
arranged clockwisely along the boundary of F in the decreasing order of their 
levels, and in the increasing order otherwise. No contradiction is caused by this 
convention. 

We assume the ordering of those terminals along the boundary of F. For two 
terminals Vi and Vj, a set of terminals encountered when we trace the boundary 
of F clockwisely from Vi to Vj is denoted by Vp {vi ,Vj). For any terminal Vi on 
the boundary of F there must be another terminal Vj so that Vi and Vj are 
interconnected by a contour line because any terminal is originally created by a 
contour line intersecting F. Such a terminal Vj is called a friend of Vi. Formally, 
it is defined by 
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Vj is a friend of Vi leve^Uj) = leve^Ui) and type('i;j) yf type(z;i). 

By the property that intensity levels change continuously along the boundary 
of F, for any terminal there is a bounded interval along the boundary of F in 
which its friend exists. We assume level(wi) > level(?;j) below. 

1. type(rii) = IN and type(t'j) = IN: 

a friend of Vi € Vp^vjjVi), a friend of vj G Vpivj^Vi). 

2. type(z;i) = OUT and type(uj) = OUT: 

a friend of Vi G Vp^Vi^Vj), a friend of vj G VF{vi,Vj). 

3. type(i;i) = IN and type(i;j) = OUT: 

a friend of Vi G Vp(vj,Vi), a friend of vj G Vp(vi,Vj). 

4. type(vi) = OUT and type(uj) = IN: 

a friend of Vi G Vp(vi,Vj), a friend of vj G Vpivj^Vi). 

Now, we turn to the problem of how to repair a flaw region by interconnecting 
terminals from disconnected contour lines in a ’’natural” way. Here we have to 
note the following constraints (see Fig. 2). When we say that two paths cross 
each other, it means that one lies in the both sides of the flaw region dissected 
by the other path. 



Constraint 0: Every contour line must become a closed loop without self- 
intersection (allowing touching itself). 

Constraint 1: Two terminals to be interconnected must be of the same level. 
Constraint 2: Two terminals to be interconnected must be of different types. 
Constraint 3: Terminals must be interconnected only within the flaw region. 
Constraint 4: No two contour lines cross each other. 






(a) legal connections 




(b) illegal connections 



Fig. 2. Constraints on interconnection of disconnected contour lines. 
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3.2 Algorithm for Reconnecting Disconnected Contour Lines 

The problem we consider in this paper is described as follows. 

Problem 1. When a region in an image represented by the contour representation 
is specified as a flaw region by a simple polygon, reconnect those contour lines 
disconnected by the flaw region so that all of the above-stated constraints are 
satisfied. In addition, the total length of paths to be added for the reconnection 
within the flaw region should be minimized. 

The previous solution to the problem by the authors was a greedy algorithm 
in which disconnected contour lines are interconnected one by one in the decreas- 
ing order of their associated area within the flaw region. More precisely, after 
ordering contour lines, based on the dynamic programming they found shortest 
interconnection paths under the constraints that they do not obstruct connec- 
tivity of remaining contour lines. So, it is a level-by-level optimization algorithm 
and it is questionable whether it obtains globally optimal interconnections. 

The algorithm to be proposed in this paper tries to And globally optimal 
solution to the problem. The basic idea of the algorithm is matching. 

First define a graph G = (F, E) as follows: Terminals on the boundary of a 
flaw region F are vertices of the graph and when two terminals Vi and Vj satisfy 
the constraints 1 and 2, that is, their types are different from each other but 
their levels are just the same, an edge is drawn between corresponding vertices. 
The weight of an edge is a combination of the two keys. The first key of an edge 
is the length of a shortest path interconnecting the two corresponding terminals 
within the flaw region, and is denoted by dij. Note that the distance is measured 
by the L 2 metric. The second key is the square root of their index difference 
(modulo n, the number of terminals). Although two different index differences 
can be considered for two terminals Vi and Vj, i.e., forward (or clockwise) and 
backward (or counter-clockwise) differences, we take their minimum as their 
index difference. Formally, the index difference between Vi and Vj is defined 
by 

fyj = min{(z — j n) mod n, {j — i n) mod n}. 

By using these two keys, we define a weight of an edge (vi,Vj) by {dij, 

When we compare the weights of two edges, we first compare their first keys 
and then the second keys if the first keys are the same. The reason will become 
clear in the following discussions why the first keys are not sufficient. 

The graph defined above is bipartite and it is obvious that it has perfect 
matching since all the terminals can be interconnected by their original contour 
lines. It is also evident that any matching satisfies all of the above constraints 0 
through 3. Thus, the problem of how to prevent crossing between different con- 
tour lines and to minimize the total length of contour lines at the same time is 
left. Here, we show that a perfect matching with minimum weight is a solution 
to this optimization problem. 

Theorem 1. Interconnecting terminals according to a perfect matching with 
minimum weight, no two contour lines cross each other. 
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Proof. Let Va,Vb,Vc, and Vd be terminals on the boundary of a flaw region, and 
assume that a path interconnecting Va and Vb and that between Vc and Vd cross 
each other. Then, we want to show that both of the edges {va,Vb) and (vc,Vd) 
cannot be included in a perfect matching with minimum weight. For this purpose, 
for any perfect matching M including both of the edges there exists a perfect 
matching with less weight. 

By the definition, we have 

type(i^a) yf type(ub), type(wc) yf type(wd), 
and 

level(t>a) = level(wb), level(ric) = level(z;d)- 

Before continuing the proof of the theorem, we need some basic observations. 

Observation 2 A shortest path interconnecting two terminals on the boundary 
of a flaw region F does not intersect itself. 

The observation follows from the fact that any path with self-intersection 
can be shortcut. 

Observation 3 A shortest path between two terminals on the boundary of a flaw 
region does not cross another shortest path between another pairs of terminals 
more than once. 

If they cross each other more than once, a closed region is formed between the 
first and second intersections p and q. Then, both of the clockwise and counter- 
clockwise paths between p and q along the boundary of the closed region are 
shortest paths. It contradicts to the uniqueness of a shortest path in a simple 
polygon. 

Case 1: The four terminals are all of the same level: 

By the definition, the shortest path between Va and Vb crosses the shortest 
path between Vc and Vd. Let the intersection be p. Then, the arrangement of the 
four terminals must be either {va,Vc,Vb,Vd) or {va,Vd,Vb,Vc) clockwisely along 
the boundary of F . Otherwise, the two shortest paths must cross even number 
of times, which is impossible. 

First consider the case of the arrangement {va,Vc,VbjVd). Then, the above 
shortest paths can be decomposed into four parts: the path P(z;a,p) from Va 
to p, the path P{p,Vb) from p to Vb, the path P{vc,p) from Vc to p, and the 
path P{p,Vd) from p to Vd. Changing their combinations to P{va,p) + P{p,Vd) 
and P(vc,p) + P{p,Vb), the resulting interconnection pattern has different pairs 
of terminals to be interconnected without increasing the total length of paths. 
If Va is of a different type from that of Vc, we should change the combination to 
P{va,p) + P{p,Vc) and P{p,Vd) + P{p,Vb). 

Since P(p, Va) + P(j>, Vd) is a path from Va to Vd, its length is no shorter than 
a shortest path between them, that is, it is no shorter than dad- In other words, 
we have 



dab + dcd = |P(wa,p)| + |P(p,'Ch)| -I- |P(?;c,p)| + |P(p,'Cd)| > dad + deb 
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if type(?;a) = type(uc), and 



dab “t” dec/ ^ dac “1” dfec/ 



if type(?;a) yf type(/;c). 

It is possible that equalities hold in the above inequalities. An example is 
shown in Fig. 3. 



Fig. 3. The case in which the sum of the shortest path lengths cannot be de- 
creased. 

On the other hand, taking the arrangement of terminals into accounts, we 
have the following equalities concerning the index difference. When the index 
difference between Va and Vb is given by their backward difference and that 
between Vc and Vd is given by their forward difference, we have 
^ab = iad + ^db and ^cd = ^cb + ibd- 

Thus, based on a simple observation that when 0<p<q<r<s and p + s < 
q + r, we have + yfs < ^/q + -y/r, the following inequalities follow. 

\/ ^ah "\/ ^cd ^ "\/ ^ad "\/ ^hc 7 and 

^Cab “t” "\/ Ced — ^Cad “t” ^bd “t” "\/ ^bc “t” Cbd ^ ^Cad “t” ^bd “t” ^bc “t” ^/Cbc — ^ ^ac “t” 



The proof for the case in which the index difference between Va and Vb is 
given by their backward difference and that between Vc and Vd is given by the 
forward one is similar to that for the above case. 

Difficulty arises when both of the indices are given in the same direction. For 
example, let us suppose that both of them are given by the backward indices. 
In this case, we have 

iab = {a — b + n) mod n, 

^cd = {c — d + n) mod n 
and thus we have 

Cab — ^ad T Ct/b and Cct/ — Cca “t” ^ad- 

Therefore, if the type of Va is different from that of Vc, interconnecting Va with Vc 
and Vb with Vd results in smaller sum of their index differences and the sum of 
the square roots of the index differences also decreases. 
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On the other hand, if Va and Vc are of the same type, it is not so obvious. In 
this case we are going to interconnect Va with Vd and Vb with Vc- Then, concerning 
their index differences, we have the equality 

^ab ^cd ^ ^ad ^bc- 

However, the equality implies 

^ad ^ 

By the simple observation above we have 
^^ab + y/^cd > V ^ad + 

Based on the observations above, it is seen that the combination {va,Vd) and 
{vb, Vc) has smaller weight in the case of type(ria) = type(?;c), and that of {va, Vc) 
and (vb,Vd) has smaller weight than that for (va,Vb) and {vc,Vd) if type(?;a) yf 
type(wc). 

If the terminals are arranged in the order Va,Vc,Vb, and Vd, we have similar 
discussion by reversing the directions of index differences. 

The proof proceeds similarly for the remaining cases. 

Based on the theorem, we can find an optimal interconnection pattern by the 
following algorithm. First, for each intensity level we enumerate all the crossings 
between contour lines of that level and the boundary of a flaw region, and after 
classifying those crossings into IN-type and OUT-type, for each pair of crossings 
of different types we calculate the length of a shortest path interconnecting 
them within the flaw region and their index difference. These informations are 
summarized as a bipartite graph. Since a flaw region is specified as a simple 
polygon, if we partition its inside into triangles, for each terminal we can compute 
the shortest path tree from the terminal (and the geodesic distances from the 
terminal to other terminals) in time proportional to the number of vertices of 
the polygon [3]. Thus, if we have n crossings and the flaw region is specified as 
an TO-gon, we can build such a bipartite graph in time 0{n? + mn). Then, we find 
a minimum-weight matching for this bipartite graph. Some known algorithms 
are available for this purpose [2]. The best known algorithm runs in 
time, which leads to the following theorem: 

Theorem 4. Let n he the number of eontour lines whieh eross a flaw region. 
Then, we ean find an optimal intereonneetion pattern for eontour lines diseon- 
nected by a flaw region in time + nm), where m is the number of vertices 

of a polygon whieh forms a flaw region. 

4 Experimental Results 

We have implemented our algorithm against several image data. Examples we 
have tested are to remove electric poles from a street, to remove claw’s feet 
around eyes, and removing characters in a wine bottle without eliminating high- 
light part, and so on. The results were satisfactory in all those cases. One of the 
examples is shown in Fig. 4 in which the flaw region depicted by a white region is 
restored in the resulting image below. A similar result could be achieved by man- 
ual operations using so-called ” copy-brush” which copy pixels in the surrounding 
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region. A manual operation, however, takes time and cost, and the quality of 
the resulting picture depends heavily on experience of human operators. This is 
why automatic processing is required. 

5 Conclusions 

In this paper we have presented an algorithm for removing flaws in a picture and 
described experimental results. The key idea to the success is the use of perfect 
matching for global optimum. One important contribution of this paper is that 
the new algorithm can guarantee global optimal solutions in less time than the 
previous one. That is, our previous algorithm found an optimal solution for each 
level, but it does not guarantee an optimal solution for all levels. 

The contour representation of an image as a collection of contour lines is a 
new representation proposed by the authors. It allows geometric treatment of an 
image, which is especially important for getting global view. This is another key 
to the success. One drawback of the representation is that it takes much storage. 
In fact, the total length of the contour lines may be much larger than the total 
number of pixels (or image size). In our application, however, we do not need to 
convert the whole image into its corresponding contour line representation. We 
are just required to have contour representation for the region specified as a flaw 
region. Thus, the overhead due to the representation is not so much. Conversion 
from contour representation to the conventional matrix form is straightforward. 
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Fig. 4. An experimental result. A picture with a flaw region above and the 
restored picture image below. 
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Abstract. Given an undirected multigraph G = {V, E) and two positive 
integers I and k, the edge-and- vertex connectivity augmentation problem 
asks to augment G by the smallest number of new edges so that the 
resulting multigraph becomes ^-edge-connected and fc- vertex-connected. 
In this paper, we show that the problem with a hxed £ and k = 3 can be 
solved in polynomial time for an arbitrary multigraph G. 



1 Introduction 

The problem of augmenting a graph by adding the smallest number of new 
edges to meet edge-connectivity or vertex-connectivity requirement has been 
extensively studied as an important subject in network design, and many efficient 
algorithms have been developed so far. However, it was only very recent to 
have algorithms for augmenting both edge-connectivity and vertex-connectivity 
simultaneously (see [6,7,8] for those results). 

Let G = (y, E) stand for an undirected multigraph with a set V of vertices 
and a set E of edges. We denote the number of vertices by n, and the number 
of pairs of adjacent vertices by m. The local edge-connectivity Xc{x,y) (resp., 
the local vertex-connectivity kg(x, y)) is defined to be the maximum £ (resp., k) 
such that there are I edge disjoint (resp., k internally vertex disjoint) paths be- 
tween X and y in G (where at most one edge between x and y is allowed in 
the set of internally vertex disjoint paths). The edge- connectivity and vertex- 
connectivity of G are defined by A(G) = min{AG(a;,y) \ x,y G V,x y} and 
k(G) = min{K( 3 (x, y) | x,y € V, x yf y}. Let r be a function: (^) ^ Z'*', 
where (^) denotes the set of unordered pairs of a;,y € V and Z+ denotes 
the set of nonnegative integers. We call a multigraph G r\- edge- connected if 
Ag(2;, y) > rx{x, y) for all x,y gV. Analogously, G is called r ^-vertex- connected 
if KG{x,y) > ri^{x,y) for all x,y GV. The edge- connectivity augmentation prob- 
lem (resp., the vertex- connectivity augmentation problem) asks to augment G 
by the smallest number of new edges so that the resulting multigraph G' be- 
comes TA-edge-connected (resp., r^-vertex-connected) . 

* This research was partially supported by the Scientific Grant-in- Aid from Ministry of 
Education, Science, Sports and Culture of Japan, and the subsidy from the Inamori 
Foundation. 
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As to the edge-connectivity augmentation problem, Watanabe and Naka- 
mura [10] first proved that the problem with r\(x,y) = i for all x,y G V can 
be solved in polynomial time for any given integer i. For a general requirement 
function r\, Frank [2] showed by using Mader’s edge-splitting theorem that the 
problem can be solved in polynomial time. 

As to the vertex-connectivity augmentation problem, the problem of making 
a (fc — l)-vertex-connected multigraph fc-vertex-connected was shown to be poly- 
nomially solvable for fc = 2 [1] and for fc = 3 [11]. It was later found out that, 
for fc € {2, 3,4}, the vertex-connectivity augmentation problem can be solved in 
polynomial time in [1,4] (for fc = 2), [3,11] (for fc = 3), and [5] (for fc = 4), even 
if the input multigraph G is not necessarily (fc — l)-vertex-connected. However, 
whether there is a polynomial time algorithm for an arbitrary constant fc was 
still an open question (even if C? is (fc — l)-vertex-connected). 

Hsu and Kao [6] first treated the problem of augmenting edge-connectivity 
and vertex-connectivity simultaneously, and presented a linear time algorithm for 
augmenting G = (V,E) with two specified subsets X and Y of K by adding the 
smallest number of edges so that the resulting multigraph G' satisfies 
Xc'{x,x') > 2 for all x,x' £ X and Kc'{y,y') > 2 for all y,y' S Y. The connec- 
tivity augmentation problem in the general setting was first studied in [7,8]. For 
two given functions r\,r^ : (^) ^ Z+, we say that G is {rx^Xf^)- connected if G 
is rA-edge-connected and rK-vertex-connected. The edge-and-vertex-connectivity 
augmentation problem, denoted by EVAP(rA,rK), asks to augment G by adding 
the smallest number of new edges so that the resulting multigraph G' becomes 
(ta, Tk (- connected, where r\{x,y) > Tf^{x,y) is assumed for all x,y €V without 
loss of generality. When a requirement function satisfies r,^{x,y) = k G 
for all x,y € V, this problem is also denoted as EVAP(rA,fc). The authors 
presented algorithms EV-AUGMENT [7] and EV-AUGMENT3 [8]. The first al- 
gorithm solves EVAP(rA,2) in 0(v?m\og{r? Im)) time, and the second solves 
EVAP(£, 3) in O(n^) time, under the assumption that £ is a fixed constant and 
the input multigraph is already 2- vertex-connected. However, it is left open 
whether EVAP(^, 3) with a fixed ^ can be solved in polynomial time for an 
arbitrary input multigraph G. 

In this paper, we consider EVAP (£, 3) for an arbitrary input multigraph 
G = {V,E), which is not necessarily 2- vertex-connected. It seems difficult to 
extend directly the above EV-AUGMENT3 to this case, since many properties 
used in EV-AUGMENT3 heavily depend on the 2-vertex-connectivity of the in- 
put multigraph. Alternatively, one may first apply EV-AUGMENT to an input 
multigraph G to obtain a (£, 2)-connected multigraph G', and then apply EV- 
AUGMENT3 to G' to obtain a (£, 3)-connected multigraph G” . However, in this 
case, the resulting miiltigraph G” may not be optimally augmented from the 
original multigraph G. 

In this paper, we first derive two lower bounds [a(G)/2] and f3{G) on 
opt{G), where opt{G) is the optimal value of EVAP(£,3). We next obtain a 
(£, 2)-connected multigraph G 2 = (V,EU F) with |F| = \a{G)/2\. We show 
that such G 2 can be computed by applying EV-AUGMENT. We then apply 
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several procedures used in EV-AUGMENT3 in order to replace some edges in F 
with the same number of edges to attain the 3-vertex-connectivity while pre- 
serving its {£, 2) -connectivity. However, those procedures cannot be directly used 
unless G is 2-vertex-connected. To remedy this, we show that there exist some 
edges in F for which the procedures can be applied. As a result of applying 
these procedures, we can show that either opt{G) = max{|’o;(G)/2],/3(G)} or 
opt{G) < 2£ — 3 holds and that a set of the smallest number of new edges can 
then be constructed in 0((2^ — -I- n?m + n^logn) time. Furthermore, 

we can show that if S{G) > £ or opt{G) > 2£ — 2 holds, then it can be found 
in 0{n?m + n^logn) time, and if opt{G) < 2£ — 3 holds, then a feasible solu- 
tion F' with |E'| < min{2opt(G)— 1, 2£ — 3, opt{G) + {£ + l)/2}, can be found 
in 0{nP‘m + v? logn) time. The entire algorithm is called EV-AUGMENT3*. 

In Section 2, after introducing basic definitions and the concept of edge- 
splitting, we derive a lower bound on the optimal value of EVAP (£, 3) for an 
arbitrary multigraph G. In Sections 3, 4 and 5, we give an outline of EV- 
AUGMENT3*. 

2 Preliminaries 

2.1 Definitions 

For a multigraph G = (V,E), an edge with end vertices u and v is denoted by 
(u, v). Given two disjoint subsets of vertices X, Y <ZV, we denote by Eo{X, Y) 
the set of edges connecting a vertex in X and a vertex in Y, and denote 
cg{X,Y) = \Eg{X,Y)\. a singleton set {x} is also denoted x. In particu- 
lar, Eg{u, v) is the set of multiple edges with end vertices u and v and cg(u, v) = 
\Eg{u,v)\ denotes its multiplicity. For a subset V GV (resp., E' C E) in G, 
G\V'] (resp., G[E']) denotes the subgraph induced by V (resp., E'), and we 
denote G[V - V'] (resp., G[E - E']) simply by G - V' (resp., G - E'). For an 
edge set F, we denote by V[E] the set of end vertices of edges in E. If F satisfies 
F(1E = 0, we denote G = {V, EUF) by G-l-F. A partition Ai, • • • , At of a vertex 
set V is a family of nonempty disjoint subsets Xi of V whose union is V , and a 
subpartition of V is a partition of a subset of V . A cut is defined to be a subset A 
of V with 0 A 7 ^ V, and the size of a cut A is defined by cg{X,V — X), which 
may also be written as cg(A). In particular, cg{v) for v G V denotes the degree 
of V. Let d(G) denote the minimum degree of G. We say that a cut A intersects 
another cut Y if none of subsets X DY, X — Y and A — A is empty, and A 
crosses Y if in addition V — (A U A) 0. A family X of subsets Ai, • • • , A„ is 
called laminar if no two subsets in X intersect each other. A multigraph G is 
called Gedge-connected if A(G) > £. For a subset A of A, a vertex v G V — X 
is called a neighbor of A if it is adjacent to some vertex u G X, and the set 
of all neighbors of A is denoted by Fg{X). A maximal connected subgraph G' 
in a multigraph G is called a component of G, and the number of components 
in G is denoted by p{G). A disconnecting set of G is defined as a cut S' of A 
such that p{G — S) > p{G) holds and no S' C S has this property. Let G denote 
the simple graph obtained from G by replacing multiple edges in Eg{u,v) by 
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a single edge (u,v) for all u,v G V. A component G" of G with |1/(G')| > 3 
always has a disconnecting set unless G' is a complete graph. If G is connected 
and contains a disconnecting set, then a disconnecting set of the minimum size 
is called a minimum disconnecting set, whose size is equal to k{G). A cut T <ZV 
is called tight if /g('T) is a minimum disconnecting set in G. A tight set T is 
called minimal if no proper subset T' of T is tight (hence, the induced subgraph 
G\T] is connected). A disconnecting set S is called a disconnecting vertex (resp., 
disconnecting pair) if IIS’! = 1 (resp., |5| = 2). We say that a disconnecting set 
S C V disconnects two disjoint subsets Y and F' of — S' if no two vertices 
X G Y and y G Y' are connected in G — S. For a disconnecting set S, there is 
a unique component X of G such that X A S, and we call the components in 
G[X] — S the S-components. 



2.2 Edge-Splitting 

Given a multigraph i/ = (1/U{s}, E), a designated vertex s, vertices m, z; G F//(s) 
(possibly u = v) and a nonnegative integer S < min{c//(s, u), Ch{s, i^)}, we con- 
struct multigraph H' = {V VJ {s}, E') from H by deleting 6 edges from Eh(s, u) 
and Eh{s, v), respectively, and adding new 6 edges to Eh(u, v). We say that iJ' 
is obtained from E[ by splitting S pair of edges (s,u) and (s,r;). A sequence of 
splittings is complete if the resulting multigraph E[' has no neighbor of s. The 
following theorem is due to Lovasz [9] . 

Theorem 1. Let H = (V U {s},A) be a multigraph with a designated vertex s 
and an integer i > 2 such that ch{s) is an even integer and Xi{{x,y) > i for 
all pairs x,y GV. Then, for any neighbor u of s, there is a neighbor v {possibly 
V = u) such that Xh' {x, y) > i for all x,y G V — s in the multigraph El' resulting 
from El by splitting one pair of edges {s,u) and {s,v). □ 

By applying this repeatedly, we see that there always exists a complete 
splitting at s such that the resulting multigraph H' satisfies En'is) = 0 and 
Xh' (x, y) > ^ for all x,y G V. 



2.3 Lower Bound on the Number of New Edges 

In the subsequent discussion, we consider EVAP(£, 3) for an arbitrary multi- 
graph G, and assume £> 4 (since the problem is equivalent to the 3-vertex- 
connectivity augmentation problem if f = 3). In this section, we derive two types 
of lower bounds a(G) and /3(G) on the optimal value opt{G) to EVAP(£,3). 

Let A be a cut in G. To make G {£, 3)-connected, it is necessary to add at least 
max{£ — cg(A), 0} edges between X and V — X, or at least max{3— |T'g(A)|, 0} 
edges between X and V — X — EciX) ifP — A — Tg(A') yf 0. Given a subpartition 
A = {Al, • • • , Aq, , A,^+i, • • • , A,J of V, where V - X, - EciXi) yf 0 holds for 
i = gi -I- 1, • • • , ( 72 , we can sum up “deficiency” max{£ — cc(Xi), 0}, i = 1, ■ ■ ■ ,qi, 
and max{3 — |/G(A'i)|, 0}, i = qi + 1, ■ ■ ■ , g 2 - Adding one edge to G contributes 
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to the deficiency of at most two cuts in X. Hence, to make G (£, 3)-connected, 
we need at least \a{G)/2\ new edges, where 



a{G) = max 

all subpartitions X 



^(£-cg(X,)) + 



^ (3-|rG(X,)|) 



i=qi+l 



( 2 . 1 ) 



and the maximum is taken over all subpartitions X = {Xi,- ■ ■ ‘,Xq^} 

of V with V - Xi- FciXi) yf 0, f + 1, • • • , 92 - 

We now consider another case in which different type of new edges become 
necessary. For a pair of two vertices S = {n, v'} of G, let Ti, • • • , denote all 
the components in G — S, where r = p{G — S) (note that S may not be a 
disconnecting pair in G). To make G 3- vertex-connected, a new edge set F must 
be added to G so that all Ti form a single connected component in {G+ F) — S. 
For this, it is necessary to add 

(i) at least p{G — S') — 1 edges to connect all components in G — S. 

Moreover, if £ > cg{u) holds for a u G S, then it is necessary to add at least 
i — cg{u) edges in order to make G £-edge-connected. Since adding an edge 
between v and v' contribute to the requirement of both v and v', we require 

(ii) at least max{£ — cg{v),£ — cg(u'), 0} edges. 

In the above, no edge in (i) is incident to v or v', while all edges in (ii) are 
incident to v or v'; hence there is no edge that belongs to both (i) and (ii). 
Therefore, it is necessary to add 



(iii) at least p{G— S) — l + u\&yi{£ — cg{v),£— cg{v')^Q} edges for S = {f,?;'}. 



This means that the following number of new edges are necessary to make G 
(£, 3)-connected. 



/3(G) 



max 

all vertex pairs 
S = {v, v'} in G 



p{G — S') — 1 -I- max{£ — cc(r'), £ 



cg(^'0:0} • 



( 2 . 2 ) 



Lemma 1. (Lower Bound) 7(G) < opt{G), where "f{G) = max{ [a(G)/2] , 
/3(G)}. □ 

Based on this, we shall prove the next result in this paper. 

Theorem 2. Let G he an arbitrary multigraph with n vertices and m adjacent 
vertex pairs. 

(1) For any integer £ > 4, 7(G) < opt(G) < max{7(G),2£ — 3} holds and an 
optimal solution o/EVAP(£,3) can be found in 0((2£— 3)n^^“^-|-n^m-|-n^logn) 
time. 

(2) If S{G) > i or 7(G) > 2£ — 2, then opt{G) = 7(G) holds and an optimal 
solution F can be found in Ofn^m + logn) time. 

(3) Ifj{G) < 2£ — 3, then a feasible solution F' o/EVAP(£,3) such that |F'| < 

min{2opt(G)— 1, 2£ — 3, op£(G) -I- (£ -I- l)/2j, can be found in 0{n'^m + logn) 
time. □ 
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3 Algorithm for EVAP(£,3) 

Given a multigraph G = {V,E), let 7^3 (G) denote the set of unordered pairs 
{x,y} of vertices x,y G V with KG{x,y) > 3. Thus V^{G) = (^) if n{G) > 3. 
For a subset F C if in G, an operation of removing a subset F' from F followed 
by adding a set F" of new edges with \F''\ = |F'| to F is called a shifting 
in F, and denoted by F" jF' . In particular, a shifting F" jF' in F is called a 
switching if it does not change the degree cg(v) of any vertex v in G. Given an 
Gedge-connected multigraph G, a sequence of switchings or shiftings of edges is 
called feasible to G if the resulting multigraph G' remains Gedge-connected and 
VsiG') D Vz{G) holds. 

We now present a polynomial time algorithm EV-AUGMENT3* for solving 
EVAP(£,3). The proofs of the properties and the analysis of the time complexity 
are omitted due to space limitation. An example of computational process of 
EV-AUGMENT3* is shown in Fig. 1. 

Algorithm EV- AUGMENTS* 

Input: An undirected multigraph G = {V,F) (|U| > 4) and an integer i > 4. 
Output: A set of new edges F with |F| = opt{G) such that G + F is (£, 3)- 
connected. 

Step I (Addition of vertex s and associated edges): Add a new vertex s 
together with a set Fi of new edges between s and V such that the resulting Gi = 
(V U {s}, F U Fi) satisfies 



cgi(A') > (. for all cuts X dV, (3.1) 

|Fg(A)| + |Fgi(s) n A| > 3 for all cuts A c U with V -X- Fg{X) 0 

and |Fg(A)| + |A|>3, (3.2) 

1 ^ 0 ( 2 :) I + CGi(s,a:) > 3 for all x gV, (3.3) 

and |Fi| is minimum subject to (3.1) - (3.3). We describe in Section 4 how to 
find such Fi. 

Property 1. |Fi| = a{G). □ 

If cgi(s) is odd, then we add to Fi a new edge e = {s,w) for an arbitrary 
vertex w G V so that cgi(s) becomes even. Gall this edge e = (s,u>) extra. 

Step II (Edge-splitting): Here we show the following property which is 
stronger than Theorem 1. 

Property 2. There is a complete splitting at s in Gi such that the resulting 
multigraph G 2 = (U, F U F 2 ) is (£, 2)-connected. □ 

Now if k(G 2 ) > 3 holds, then we are done, because IF 2 I = |Fi|/2 = |’o;(G)/2] 
attains the lower bound of Lemma 1. Otherwise (k(G 2 ) = 2), we can observe 
from (3.2) and (3.3) that the following holds for the family of minimal tight sets 
in G 2 , denoted by T(G 2 ). 
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T n V[F2] 7^ 0 for all T G T(G2), 

\Eg,{x) n F2I > 2 for all T = {x} £ T{G2) with iFcix)] = 1, ( 3 . 4 ) 

\EG2lx) n F2I > 3 for all T = {a;} G E{G2) with jrG(a;)j = 0 . 

Step III (Switching edges): Now the current G2 = {V,E\J E2) is (^, 2 )- 
connected, and satisfies ( 3 . 4 ). During this step, we try to make G2 3 - vertex- 
connected by switching some edges in E2 while preserving the £-edge- 
connectivity. In [8, Property 3 . 2 ], some sufficient conditions for two edges 
61,62 G E2 that are disconnected by some disconnecting pair and satisfies 
k(G 2 — {ei, 62}) > 2, are given to admit a feasible switching such that at least 
one new pair of vertices in G2 becomes 3 - vertex-connected. However, if an in- 
put multigraph G is not 2 - vertex-connected (a situation not assumed in [8]), 
there may be an edge e G F2 such that G2 — e is not 2 - vertex-connected. 

Let E*{G2) C F2 denotes the set of edges in F2 such that the removal of any 

edge e G F*{G2) violates 2 - vertex-connectivity of G2. 

Property 3 . For each tight set T in G2, G2[T U Fg2{T)] contains at least one 
edge e G F2 — F*(G2) with T n V[e] 7^ 0 . □ 

Property Let S' be a disconnecting pair in G2. If two edges Ci, 62 G F2—F*{G2) 
satisfy Ti n V[ei] 7^ 0 , i = 1 , 2 , for two distinct S-components Ti and T2 in G2, 
then G2 — {61,62} is 2- vertex-connected. □ 

From these properties, every tight set contains an edge in F2— F*(G2) and the 
two edges 61, 62 G F2— F*(G2) satisfying one of the conditions in [8, Property 3 . 2 ] 
always satisfy k{G2 — (ci, 62}) > 2 . Therefore we can repeat executing a feasible 
switching of pairs of edges in F2 — F*{G2) until none of the conditions in [8, 
Property 3 . 2 ] holds in G2. Then it is not difficult to see that all disconnecting 
pairs in G2 contain one common vertex, say v*. 

In [8, Property 3 . 3 ], another case in which a feasible switching can be per- 
formed is given. In this paper, we show a generalization of [8, Property 3 . 3 ] as 
it is now proved including the case of k{G2 — {61, 62, 63}) < 1. 

If none of the conditions in [8, Properties 3 . 2 , 3 . 3 ] for a feasible switch- 
ing holds any longer, let G3 = {V,E LI F3) denote the resulting multigraph, 
where F3 = F2. If niGs) > 3 , we are done since \Fs\ = \a{G)/ 2 \ attains the 
lower bound 7(G). Otherwise go to Step IV. 

Step IV (Shifting edges): 

Property 5 . Let Si = (v*, Ui}, i = I, • • • , q, denote all disconnecting pairs in G3. 
Then every e G F*(G3) satisfies e = (vi,Vj) for some i 7^ j, or e = (t,Vi) with 
Fcsit) = {u*,i;i} for some i. □ 

In [8, Property 3 . 4 ], some conditions are given to admit a feasible shifting 
of an edge 61 G F3 — F*(G3) incident to the common vertex v* (and another 
edge 62 G F3 — F*{Gs) such that k(G 3 — jei, 62}) > 2 holds and 61 and 62 are 
disconnected by some disconnecting pair in G3, if necessary) that decreases the 
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degree of v* by one. From Property 5, Eo^{v*) n F3 C F3 — F*{Gs) follows. 
Furthermore, from Property 4, every two edges 61,62 € F3 — F*{G3) that are 
disconnected by some disconnecting pair satisfy k(C ?3 — {61,62}) > 2. Therefore 
we can apply [8, Property 3.4] to G3, and repeat a feasible shifting and switching 
edges in F3 — F*{G3) until IF3 n Fa3(v*)\ = 0 or ccsiv*) = I holds. 

Property 6. If F*(G3) yf 0 holds, we can execute a feasible switching or shifting 
of edges in F3 — F*{G3) by applying [8, Properties 3.2, 3,3, and 3.4]. □ 

Let G4 = (P, if U F4) denote the resulting miiltigraph G3 = (P, if U F3) ob- 
tained by repeating a feasible switching or shifting of edges in F3 until none of 
conditions in [8, Properties 3.2, 3,3, and 3.4] holds in G3. It holds F*{Gi) = 0 
from Property 6. In this case, we observe that we can apply the latter part 
of Step IV and Step V of EV-AUGMENT3 [8] to G4 (while maintaining con- 
dition F*{Gi) = 0). Then as observed in [8, Property 3.8], we see that an in- 
put multigraph G becomes (£, 3)-connected by adding min|]’a(G)/2] , /3(G)} new 
edges or by adding at most 2£ — 3 new edges. This establishes Theorem 2. □ 

4 Algorithm for Step I 

We consider how to find a set Fi such that Gi = (P U s, if U Fi) satisfies (3.1) - 
(3.3), and has a subpartition X = {Wi, • • • , ■ ■ ■ ,Xq^} of P, satisfying 

6Gi(s, A) = £ - ca(Xi) for i = 1, • • • ,gi, 

6Gi(s, V,) = 3 - ]rG(W)l and V-X,-Fg{X,) ^ 0 for / = gi+1,- • -,52, (4-1) 

tci (s) C UxGxX. 

Note that such an edge set Fi attains jFi] = a{G). 

Algorithm ADD-EDGE* 

1. After adding a sufficiently large number (say, i) of edges between s and each 
vertex v G V, discard them one by one as long as (3.1), (3.2) or (3.3) is not 
violated. Let G[ = (P U s, F U F{) be the resulting multigraph where F{ = 
Fg[{s,V). Unless this G[ has a subpartition of P satisfying (4.1), we continue 
shifting or removing edges in Eg'^{s,V), while preserving (3.1) - (3.3). 

Property 7. For each edge e = (s, t) G Eq' (s, V) such that G{ — e violates (3.2) 
or (3.3), G'l has a cut T CV with t gT and V — T — Pg{T) yf 0, satisfying 

(I) lFG(r)l = 2 and Eg>{s,T) = |(s,f)}, or 

(II) ]FG(r)l = 1 and CG[{s,t) = cg[(s,u) = 1 hold for t yf u G T if jF] > 2, 

or ]FG(t)l = 1 and CG[{s,t) = 2 if F = {t}. □ 

If F C P satisfies (I) or (II) in Property 7, F is called K-critical. Let 7} 
(resp., T2) denote the family of all K-critical cuts F of type (I) (resp., (II)) such 
that no F' C F is of type (I) (resp., (II)). For each edge e = (s,u) G F[ such 
that G'l — e violates (3.1), there is a unique cut A„ C P, called X- critical, such 
that u G Xu, cg[ (A1„) = £ and cg^ (Y) > £ holds for all cuts Y with u GY C Xu- 
Let Xi denotes the family all A-critical cuts A„. 
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Fig. 1. Computational process of algorithm EV-AUGMENT3* for £ = 8. (1) An 
input multigraph G = (V, E) with A(G) = k(G) = 1, where the number beside 
each edge is the multiplicity of the edge (the numbers for multiplicity 1 are 
omitted). The two lower bounds in Section 2 are = ^ = 9 and /3(G) = 3. 

(2) Gi = (V U {s}, GU Fi) obtained by Step I. Edges in F\ are drawn as broken 
lines. Now Gi satisfies (3.1) for £ = 8, (3.2) and (3.3), and the edge e = (s,w) 
is extra. (2-1) G'^ = (V U {s},GU F[) with F[ C Fi in Step II, satisfying (3.1) 
and (5.1). (2-2) G2 = (E U {s},E U {Fi — F{) U F 2 ) in Step II, where F 2 is a 
set of edges obtained from splitting edges in F{. Now G^)^] is (8, 2) -connected 
and every edge in Eo'^{s,V) — e is K-critical. (2-3) G '2 = (E U {s},E U F 2 ) in 
Step II obtained from splitting edges in Fi — F{ in G2, which creates no self-loop. 

(3) G2 = (E, E U F2) obtained by Step II. The G2 satisfies A(G2) > 8 but has a 

disconnecting pair S = {u*, u}. (4) G^^^ = (E, if U £^2^^) obtained from G2 by a 
feasible switching {(mi, U 2), (wi, W2 )}/{(mi, wi), (r«2, W2)} in Step III. Moreover, 
any switching is no longer feasible in (5) G4 = {V,E U F4) obtained by 

shifting {(ui, U3)}/{(?^*, U3)} in Step IV. This G4 is (8, 3)-connected. □ 



Property 8. Let := 7) — {Ti,Tj G 7) | and Tj cross each other in 
} — {Ti & Fi \ Ti C Tj for some Tj G 7^}. Then Ai U U 72 covers T'g'^(s), 
and every two cuts in U 72 are pairwise disjoint. □ 

From this property, if G\ does not have a subpartition of E satisfying (4.1), 
then there are two cuts A G Ai and T G T)' U 72 satisfying the following: 

(i) CG/ (s, T) = 1 if T G r/, and cg; (s, T) = 2 if T G T2. 

(ii) r n A ^ 0 and (T - A,) n Tg; (s) ^ 0. 



(4.2) 
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2. Let H denote an arbitrary 2-vertex-connected multigraph. Let us regard as 
the family T (H) of all minimal tight sets in H, since every two cuts in are pair- 
wise disjoint and every cut T G T{ satisfies |/G(r)| = 2 and 

V — T — Fc'^iT) yf 0. It was shown in the algorithm ADD-EDGE in [8] that 
the number of two cuts X G Xi and T G T{H) satisfying (4.2) in H can be 
decreased to 0. Hence, by applying the same algorithm, the number of pairs of 
cuts X G and T G satisfying (4.2) can be decreased to 0. Moreover, we 
can see that the resulting multigraph G'( has a subpartition of V satisfying (4.1) 
and hence let Gi := G". □ 

5 Algorithm for Step II 
Algorithm SPLIT 

1. Let F{ C be a set of edges such that G[ = (E U s,E U F{) satisfies (3.1) 
and the following (5.1), but no {V Us,ifUF') with F' C F[ satisfies them. 

|rG(X)| -h |/g',(s) n A| > 2 for all cuts A C E with V -X- Fg{X) yf 0 

and |Eg(A)| + |A| >2, (5.1) 

cg[{s,x) > 2 for all X = {a;} with Fa{X) = 0. 

If cg'j(s) is odd, then we choose one edge e* G F\ — and update G( := 
Gj -I- e* and F[ := F{ + e* . Then it is shown in [7] that by a complete splitting 
at s in G'l, which does not create a self-loop, we can obtain a multigraph G '2 = 
(E U {s}, E U (Fi — F[) U F 2 ) satisfying one of the following conditions, where F 2 
is the set of edges obtained from splitting edges in F[ . 

(i) G 2 [E] is (£, 2)-connected. 

(ii) G 2 [E] is (£,l)-connected and has exactly one disconnecting vertex v. At 
most one edge in F 2 is incident to v, and each edge e G F 2 which is not 
incident to v satisfies p{G 2 [V] — v) = p((G 2 [E] — e) — v) — 1. 

If G 2 satisfies (ii), go to step 2; otherwise go to step 3. 

2. Let Tj, i = 1, • • • , r', denote all n-components in G 2 [E]. If G 2 [E] has the edge 
e = {v,ti) G F 2 incident to v, then assume ti G Ti without loss of generality. 
Here we can prove from the above property of edges in F 2 and properties (3.2) 
and (3.3) in Gi that cg>^{s,Ti) > 1 and CG^(s,7i) > 2 hold for i = 2,---,r'. 
Let G '2 = (E U {s},E U (E — F{ — F{') U E 2 U E 2 O be the multigraph result- 
ing from a sequence of splitting pairs of edges (s,ai) and (s, fc^+i) to (ai,6i+i), 
i = 1, • • • , r' - 1, where oi G Ti n Fq'^ (s), ai,bi GTif] Fq'^ (s) for i = 2, • • • , r', 

= {{(s,a^),(s,b,+l)} I i = l,---,r'-l}, and E 2 ' = {(a^,bi+l) | i = 
1, • • • , r' — 1}. This G 2 [E] is now (£, 2)-connected. Let G 2 := G 2 , F( := F( U F", 
and F 2 := F 2 U F 2 , and go to 3. 

3. Now G 2 [E] is (£, 2)-connected and cg^(s) is even. Here it is not difficult to 

see that G '2 has a complete splitting at s which creates no self-loop (if necessary, 
the extra edge e chosen in Step I will be rechosen). Let G 2 := G 2 — s, where G 2 
denotes the multigraph resulting from such a complete splitting in G 2 . □ 
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6 Concluding Remarks 

In this paper, we combined the algorithms EV- AUGMENT [7] and 
EV-AUGMENT3 [8], and gave an algorithm for augmenting a given arbitrary 
graph G to an Gedge-connected and 3-vertex-connected graph by adding the 
smallest number of new edges. However, our lower bound on the optimal value 
does not always give the exact optimal value. So, it is desired to find a new and 
stronger lower bound on the optimal value. 
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Abstract. Let U be a finite set, and 5R be the set of reals. We consider 
the polyhedron P = {z G | ^ 

a system {V, f) with an intersecting submodular and posi-modular set 
function f : 2^^ where denotes the set of |U|-dimensional 

nonpositive vectors. We first prove that there is a laminar family A C 2'^ 
such that P is characterized by {z € | ■*(*) — /(^)> ^ '^}- 

Based on this, we can solve in polynomial time the edge-connectivity 
augmentation problem with an additional constraint that the number of 
vertices to which new edges are incident is minimized. 

1 Introduction 

Let U be a finite set, where |U| is denoted by n. A singleton set {t;} may be 
written as v. For two subsets X,Y C V, we say that X and Y intersect each 
other if XPY yf 0, X — Y yf 0 and Y — X yf 0. A family A C 2^^ is called laminar 
if no two subsets in X intersect each other. Two intersecting sets X and Y are 
called crossing if U — (A U K) yf 0 also holds. 

Let 5? (resp., and 3?_) be the set of reals (resp., nonnegative reals and 
nonpositive reals), and let (resp., and be the set of n-dimensional 
real vectors (resp., nonnegative real vectors and nonpositive real vectors) on a 
ground set V. A set function f on U is a function / : 2^ i-^ 3?. For a vector 
z G 3?^ and a subset X C V, we denote Such a function 

z : 2^ I— > 3? is called modular. A function / is called fully (resp., intersecting, 
crossing ) submodular if it satisfies the following inequality 



holds for every (resp., intersecting, crossing) pair of sets X,Y CV. A function / 
is called fully (resp., intersecting, crossing) supermodular if — / is fully (resp., 
intersecting, crossing) submodular. An / is called symmetric if f{X) = f(V — X) 
holds for all A C U. In this paper, we call a function / fully (resp., intersecting, 
crossing) posi-modular if 



* This research was partially supported by the Scientific Grant-in- Aid from Ministry of 
Education, Science, Sports and Culture of Japan, and the subsidy from the Inamori 
Foundation. 



/(A) + /(F) >/(Anr) + /(AUF) 



( 1 ) 



f{X) + f{Y)>f{X-Y) + f{Y-X) 



( 2 ) 



K.-Y. Chwa and O. H. Ibarra (Eds.): ISAAC’98, LNCS 1533, pp. 169—178, 1998. 
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holds for every (resp., intersecting, crossing) pair of sets X,Y C V [6]. An / 
is called fully (resp., intersecting, crossing) nega-modular if — / is fully (resp., 
intersecting, crossing) posi-modular. Any modular function ^ such that z(i) > 0 
for all i £ V is posi-modular. Also a symmetric fully submodular function / is 
fully posi-modular. However, the converse is not generally true. 

A pair {V, f) of a finite set V and a set function / on 1/ is called a system. 
The optimization in a system (V, f) has been much studied, such as: 

Problem 1 (primal type): minimize d^{z) 

subject to z{X) < f{X) for all X G 2^ 

0 < z{i) < d{i) for ali gV 

(an additional constraint z{V) = f{V) may also be imposed), where <l>{z) : 
EX I— *■ 3? is an objective function and d G is a given constant vector. For 
fully submodular functions /, Problem 1 appears in many applications [5]. Given 
a system {V,g), a dual type of this problem is stated as follows: 

Problem 2 (dual type): minimize >P{t) 

subject to g{X) < t(X) for all X G 2^ 

0 < t(i) < d(i) for all z G P 

(where we may also impose an additional constraint t(V) = g(V)). Problem 2 
with a certain supermodular function g appears in the edge-connectivity aug- 
mentation problem [1,3] and the problem of computing the core of a convex 
game [8]. The above Problems 1 and 2 are generalized into the following com- 
mon formulation. For two set functions g and / on V , and vectors c?i,c ?2 G 
we consider: 

Problem 3 (mixed type): minimize 

subject to g{X) < z{X) < f{X) for all X G 2^ 
di{i) < z{i) < d 2 {i) for all i gV. 

In this paper, we consider Problems 1-3 with intersecting submodular and 
posi-modular functions / and —g. 

Before going into details, let us explain an application of Problem 2 to 
the edge-connectivity augmentation problem. Let N = (V, E, c) be an undi- 
rected complete network with a vertex set V, an edge set E = V x V and 
an edge weight function c : E 3i_|_. The cut function fjsi : 2^ i-^ 3J_|_ 
is defined by /at(A') = X){c(e) | e = {m,zz} G E, u G X, v G V — X} 
(where /at(0) = fN{V) = 0). It is known (and easy to see) that the cut func- 
tion /at is symmetric and fully submodular. The edge-connectivity augmenta- 
tion problem asks to increase edge weights c to obtain a fc-edge-connected net- 
work N' (i.e., fM'{X) > k holds for all X G 2^ — {0,P}). Frank [3] intro- 
duced an additional constraint to this problem, the degree constraint: Given a 
vector d G the output fc-edge-connected network N' is required to satisfy 
^gg^(-)(c'(e) — c(e)) < d{i) for all i G V, where E{i) denotes the set of edges 
incident to a vertex i. The problem can be formulated as Problem 2 by using 
the following result. 
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Lemma 1. [1,3] Given a network N = (V,E,c), a constant k > 0, and a vector 
t G such that 

fN{X) + t{X) > k for all Xg2^ - {0, V}, (3) 

there is a k- edge- connected network N' = {V,E,c') satisfying ~ 

c(e)) = t{i) for all i GV . Also, the d can be chosen as integers if c, t and k >2 
are integers and t{V) = ^(*) even integer. □ 

Notice that the total increase ~ ^(e)) of weights is ^t{V). There- 

fore, in order to solve the edge-connectivity augmentation problem, we only 
need to find a vector t G that minimizes t(V) = f^mong all vec- 

tors t satisfying (3) (and t(i) < d{i), i GV ii the degree constraint is imposed). 
Hence, by defining <?(t) = \t{V) and a fully supermodular set function g by 
g{X) = k - fN{X) for all X G 2^, we see that the smallest amount a{N, k) 
of new weights to be added to obtain a fc-edge-connected network N' is given 
by the minimum value of <P{t) over all t G 3?+ satisfying g{X) < t{X) for all 

X g2^ — {%,V} (and t{i) < d{i), i gV \i the degree constraint is imposed). In 

the case of integer version, a{N, k) is given by [ \ . In any case, the problem 
of finding such a vector t can be formulated as Problem 2 with these <!>, g and d. 



In this paper, we first characterize the polyhedra of Problems 1-3 with in- 
tersecting submodular and posi-modular functions / and —g, and then present 
a combinatorial algorithm for solving Problem 3 with a linear function <?(t) 
(and assuming a further restriction on g). Note that Problem 3 is more general 
than Problem 2 in the sense that it allows additional constraints z{X) < f{X), 
X G 2^ . This enables us to solve in polynomial time the edge-connectivity aug- 
mentation problem with a more general degree constraint that, for each subset 
X <ZV , the total increase of degrees in X in the resulting network N' is bounded 
by a given constant f{X). 

We also show that Problem 2 can be solved for an objective function <I>{t) = 
\{i G V \ t{i) > 0}| in O(n^) function value oracle calls. Based on this, we 
can solve in polynomial time the problem of augmenting edge-connectivity of a 
network so as to minimize the number of vertices having edges whose weights 
are increased. 

The paper is organized as follows. In Section 2, we characterize the polyhe- 
dron of Problem 2, and give algorithms for solving Problems 1-3. In Section 3, 
we characterize all the extreme points of the base polyhedron of Problem 2, and 
discuss a relation to the core of a convex game. 

2 Polyhedral Structures and Problems 1-3 

A polyhedron of a system {V, /) is defined by 

P{f) = {ZG^^ \ z{X) < fix), VX € 2^ - {0, P}}, 



( 1 ) 
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where X = 0 and V are not considered in the definition, and a base polyhedron 
of {VJ)hy 

B{f) = {zGP{f)\z{V) = f{V)}, (2) 

where possibly B{f) = 0. Let P-{f) and B-{f) denote P{f) H 3?^;^ and 
B{f) n 3?^, respectively, and let P+{f) and B+{f) denote P{f) n 3?^ and 
B{f) n 3?^, respectively. For two set functions /i and /2 on V, we denote by 
(/i — / 2 ) the set function /' with f'{X) = fi{X) — f 2 {X) for all X G 2^ . 

We say that a subset X C V separates x and y if \{x,y} n X| = 1. For a 
submodular system (V,f), an ordered pair {x,y) of elements in V is called a 
pendant pair if f{x) < f{X) holds for all sets X dV that separate x and y. 

Lemma 2. [6] For an intersecting submodular and posi-modular set function f 

on V {where n = \V\ > 2), there exists a pendant pair {x,y). Furthermore, such 
a pendant pair can be obtained by using 0{n^) function value oracle calls. □ 

2.1 Polyhedral Structure of P—{f) 

In this section, we first consider the set of all feasible vectors to Problem 2, 
where we assume that g is an intersecting supermodular and nega-modular set 
function and a vector d G 3?}(^ is given by d{i) = +oo {i G V). In this case, 
/ = —g is intersecting submodular and posi-modular. Then a vector t is feasible 
to Problem 2 if and only if — t G P-{f) holds for a system (V, /). 

We now prove that, given a system (V, /) with an intersecting submodular 
and posi-modular set function /, there is a laminar family X C 2'^ — {0,P} that 
characterizes P-{f) as follows. 

P_(/) = P_(/;T’), (3) 

where we use notations P{f-,X) = {z G 3?^ | z{X) < f{X) for all X G X} 
and P_ (/; df) = P(/; df) n 3?^ . 

Given an intersecting submodular and posi-modular set function / on V, we 
compute the above laminar family X as follows. Initially we set df := 0 and 
z{i) := 0 for all i G V . Then, for each i G V, we check whether z{i) < f{i) 
(i.e., f{i) > 0) holds or not. If f{i) < 0 then we reset z{i) by z{i) := f{i) 
and add {i} to X. Now z{i) < f{i) (i.e., (/ — z){i) > 0) holds for all i G 
V . Note that f — z remains to be intersecting submodular and posi-modular. 
Hence there is a pendant pair {x, y) in system {V, f — z)hy Lemma 2, for which 
any cut X separating x and y satisfies z{X) < f{X). Then we can contract x 
and y into a single element x* without losing any cut X that satisfies z(X) > 
f(X). After this contraction, we check whether the new element x* satisfies 
(f — zXx*) >0. If (/ — z)(x*') < 0, then we add to X the set X* of all elements 
which have been contracted into x* so far, and decrease z{i) of some i G X* so 
that z{X*) = f{X*) holds (where more than one z{i) may be decreased as long 
as z{X*) = f{X*) is satisfied). (If (/ — z)(a;*) > 0, no z{i) is changed.) Then we 
repeat finding a new pendant pair and contracting them into a single element in 
the resulting system, until the system has only one element. The entire algorithm 
is described as follows. 
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Algorithm LAMINAR 

Input: A system (V, /) with an intersecting submodular and posi-modular set 
function f on V, 
where n =\V\> 2. 

Output: A vector z G P-(f), and a laminar X of V satisfying (3). 

1 A := 0; z{i) := 0 for all i G V; 

2 For each i €V, if f{i) < 0 then z(i) := f{i) and X := X U {{*}}; 

3 for z := 1 to n — 1 do 

4 Find a pendant pair {x, y) in (A', / — z); 

5 Let {V , f — z) again denote the system obtained from the current 
{V , f — z) by contracting x and y into a single element x*; 

6 if (/ — z){x*) < 0 then 

7 Let A* be the set of all elements of V which have been contracted 
to X*; 

8 Decrease z(i), i G X* arbitrarily so that the resulting f — z satisfies 
(/-z)(A*) = 0in(A,/); 

9 X:=XU{X*} 

10 end/* if*/ 

11 end. /* for */ 



Clearly, LAMINAR runs in O(n^) function value oracle calls, as the pendant 
pair in line 4 can be found in 0{nf) function value oracle calls by Lemma 2. 
Note that the vector z output by LAMINAR may not be unique because there 
are many ways of decreasing z(z), i G X* in line 8. Let OUTPUT{f) denote the 
set of all vectors z that can be output by LAMINAR for a given input /. 

For a laminar family A C 2^^ on R, a subset A G A is called a child of a 
subset A G A (and the A is called the parent of A) if A C A and there is no 
other subset A' G A with A C A' C A. For a subset A G A, let ch{X) denote 
the set of children of A, and pa{X) denote the parent of A (possibly pa{X) = 0). 

Let A be a family of subsets of V output by LAMINAR, which is clearly 
laminar. We represent A by a rooted tree as follows. Define the laminar family 
V = A U {A} U {{z} I z G A} and define the tree T = (V,S) on V, where 
the parent-child relation in the tree is given by pa{X) and ch{X). Clearly A 
is the root of T. Define /' : V 3? by f'{X) = 0 if A = {z} and /(z) > 0; 
/'(A) = /(A) otherwise. From the behavior of LAMINAR, the next properties 
are observed. 

Lemma 3. For a system (A, /) with an intersecting submodular and posi- 
modular set function f on V with \V\ >2, let z and A be the vector and the lam- 
inar family output by algorithm LAMINAR. Let the tree T be defined as above. 
Then: 

(i) z G P-if) (hence OUTPUT{f) C P_(/)). 

(ii) For each non-root vertex X in T, f'(X) < J^Y^chix) f'(X) holds. □ 

From this lemma, we can prove (3). Clearly, P-(f; A) D P_(f). The converse 
is shown as follows. Given a vector z' G P-{f), there is a way of decreasing 
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{z(i) I i € X*} in line 8 of LAMINAR so that the algorithm outputs a vector 
z(S OUTPUT{f)) such that z > z' . Hence z' € P-{f) follows (further detail is 
omitted) . 

Theorem 1. For a system (V, /) with an intersecting submodular and posi- 
modular set function f on V with \V\ > 2, let X be the laminar family output 
by algorithm LAMINAR. Then: 

(i) P_(/) = P_(/;A). 

(ii) B_{f) if and only if f'{V) < J2Yech{v) ° 

2.2 Problem 2 

Based on Theorem 1, we can solve Problem 2, if it has a linear objective function 
d>{t) = ^(*)^(*) defined by a cost vector w G 3?'^. The proof for this case 

will be given in Section 2.4 under a more general setting of Problem 3. We 
consider another special case of Problem 2 when the objective function is given 

by 

<l>{t) = \{i & V \ t{i) > 0}| (i.e., the number of nonzero entries). 

By Theorem 1, we prove the next property (the proof is omitted). 

Theorem 2. For an intersecting supermodular and nega-modular set function g 
on V, and a vector d G Problem 2 with an objective function T>{t) = \{i gV \ 
t{i) > 0}| can be solved by using 0{n^) function value oracle calls. If Problem 2 
is feasible, then there is a feasible vector t which minimizes 'T(t) and <T'(t) = 
^(*) same time. Such a solution t also can be found by using 0{n^) 

function value oracle calls. □ 

By applying this and Lemma 1 to the degree constrained edge-connectivity 
augmentation problem of a graph to minimize the number of vertices whose 
incident edges have increased weights, we obtain the next (the proof is omitted) . 

Theorem 3. For a complete network N = {V, E = V xV, c) , k > 2 and d G 
where all c(e), k and d(i) are integers, let c'(> c) be a new integer-valued edge 
weight function such that N' = (V,E,c') is k- edge- connected under the degree 
constraint that new degree of each vertex i G V is at most d{i). There is a d 
that simultaneously minimizes (i) the number of vertices to which an edge with 
increased weight is incident and (ii) the total amount of increase ~ 

c(e)). Such a d can be found in 0{{nm -\- n'^ log n) log n) time, where n = \V\ 
and m is the number of edges of positive weights in N. □ 

2.3 Problem 1 

In this subsection, we consider polyhedra P{f) and P+{f) for an intersecting 
submodular and posi-modular function / on V, which appear in Problem 1. 
However, we do not consider the constraint z < d, as this more general case will 
be considered in the next subsection as Problem 3. 
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To generalize Theorem 1 to this case, we further assume that the set func- 
tion / defined by 

f {X) = f {X) - mf{X) for X& 2^ 

is intersecting submodular and posi-modular, where m/ denotes the modular 
function on V defined from / by m/(i) = f{i) for all i G V. 

Now we discuss how to compute a vector z G P+(/). Let us consider y = 
z — mf. Then 0 < z{X) < f{X) holds if and only if 0 < y{X) + mf{X) < 
f{X) = f{X) — mf{X). Thus the problem is equivalent to finding a vector 
y G P{f) n{y G^^ \y + mf >0}. 

We first consider P{f). Note that P{f) = P-{f) since f{i) = 0 holds for all 
i G V. Therefore, P{f) = {z = y + mf \ y G P_(/)}. By applying Theorem 1 
to system {V, /), we obtain a laminar family X such that P-{f) = P-{f', X), by 
using 0{v?) function value oracle calls. Clearly, for any z G P{f;X), we have 
y = z — mf G P-{f\X) = P-if), and hence 2 € P(f) (the converse is also 
clear). Thus, P{f) = P{f; X) holds. 

Next consider a vector z G P+ (/) . From the above argument, it holds P+ (/) = 
P{f)n'SiX = {z = y + mf \ y G P-{f), y > -mj}. A vector y G P-{f) satisfying 
constraint y > —to/ (if any) can be easily computed. From these, we establish 
the next result. 

Theorem 4. For a system (V, /) with a set function f on V with n = \V\ >2 
such that f — mf is intersecting submodular and posi-modular, there is a laminar 
family X such that P{f) = P{f;X). Such a family X and a vector z G P+{f) 
{if any) can be found by O(n^) function value oracle calls. □ 



2.4 Problem 3 

In this subsection, we solve Problem 3 with a linear objective function <P{z) = 
w{i)z{i) for a given vector w G 3?'^. 

Theorem 5. Let g and f be set functions on V, and d\,d 2 G and w G 
If —g and f — mf are both intersecting submodular and posi-modular, then an 
optimal solution z to Problem 3 with objective function ^(z) = w{i)z{i) 

can be found by using 0{n^) function value oracle calls and by solving a minimum 
cost flow problem with 0{n) vertices and arcs. 

Proof Sketch: By Theorems 1 and 4, there are laminar families Xi and X 2 
such that {z G 3i)j( | g{X) < z{X) for all X G Xi} = {z G 3?)|( | g{X) < 
z{X) for all A G 2^ - {0, V}} and {z G 3?^ | z{X) < f{X) for all X G X 2 } = 
{z G 3?'^ I z{X) < f{X) for all A G 2^ — {0, P}}. Thus, the problem is restated 

minimize d>{z) = X^igy w{i)z{i) 
subject to g{X) < z(A) for all X G X[ 

z(A) < /(A) for all X G X!^ 

di{i) < z{i) < d 2 {i) for all i gV, 



as 
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where X[ = U {0,1^} for i = 1,2. Then it is not difficult to see that the 
problem can be formulated as the minimum cost flow problem in a directed 
network (further detail is omitted). □ 

3 Extreme Points of Base Polyhedron 

In this section, we characterize all extreme points of a base polyhedron B_(/) 
defined for an intersecting submodular and posi-modular set function /. We then 
show some relation of the result to a core in a convex game. 



3.1 All Extreme Points of B-{f) 

Let 7T„ be the set of all permutations of (l,2,...,n). For a subset P C 
and a permutation tt € ^\v\! ^ vector z € 3?)^ is called lexicographically tt- 
minimal {tt -minimal, for short) in P if there is no other vector z' € P which is 
lexicographically smaller than z with respect to tt; i.e., there is no j such that 
z'(7t(z)) = z(7r(i)) for i = 1, 2, ... ,j — 1 and z'(7r(j)) < z(7r(j)). Let L{f) be the 
set of TT-minimal vectors in B-{f) for all tt € Pin, and EP{f) be the set of all 
extreme points in Based on Theorem 1, we can show the next result. 

Theorem 6. Let (V, f) he a system with an intersecting submodular and posi- 
modular set function f on V with n = \V\ > 2. If B-{f) ^ 0, then L{f) = EP(f) 
holds. □ 

If B-{f) 7 ^ 0 and L{f) = EP{f), then we define the mean vector ipp of all tt- 
minimal vectors z,r, tt S II n by tf f = ^ \ ^ Bn}, where possibly z^ = z,r' 

holds for two permutations 7r,7r' € 7T„. Again by Theorem I, we can show 
that, for an intersecting submodular and posi-modular set function /, the mean 
vector fjf can be efficiently computed from the laminar family X. 

Theorem 7. For a system (V, f) with an intersecting submodular and posi- 
modular set function f on V with n = \V\ > 2, let X be the laminar family 
output by algorithm L AMIN AB. Then the mean vector ip f of all lexicographically 
minimal vectors in B-{f) can be computed from X in 0{n^) time. □ 



3.2 A Relation to a Convex Game 

A cooperative game in the game theory is defined by a pair (V, g) of a set V of 
players and a nonnegative set function g on V, where g is called the characteristic 
function of the game and satisfies g(0) = 0. Several solution concepts such as 
core, Shapley value, r-value and others have been proposed. The core of a game 
(V,g) is the set CORE{g) of nonnegative vectors z € such that z{X) > 
g{X) for all X G 2^ and ziV) = g{V). In other words, it can be defined by 
CORE{g) = {— z' I z' S B-{—g)} for a system {V,—g). Note that CORE{g) 
is always a convex set. The problem of testing whether a convex game (V, g) 
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has a nonempty CORE(g) can be solved by computing the minimum value 
^(-2) = SiGV Problem 2 with d = +oo. 

The Shapley value (f)g € 3?'^ is a solution concept proposed by Shapley [7], 
which is defined by 

= X! - g{S - f)] for each i e V. 

SCV: ies 

A game {V, g) is called convex if g is a fully supermodular set function on V . 
Several structural properties have been studied for a convex game. In this section, 
we consider a game (P, h) with an intersecting supermodular and nega- modular 
function h on V, which is slightly different from a convex game (V,g). We show 
that a game (V, h) has a considerably different structure from that of a convex 
game. Let us review some structural properties of a convex game. 

Theorem 8. [2,8] For a convex game {V^g), CORE(g) is always nonempty. 

For any permutation tt G Un, the n -minimal vector Zt^ € belongs to CORE{g) 

and is given by ZT,{7r{i)) = 5({7r(l), 7 t( 2), . . . ,7r(f)}) -g({7r(l), 7 t( 2), . . . ,7r(i-l)}) 
for i = 1,2, ... ,n. Moreover, the set of all extreme points of CORE{g) is given 
by the set of n -minimal vectors 2 ^, tt S 7T„. □ 

We denote by ifg the mean vector of all 7r-minimal vectors Zt^, tt G Fin, for a 
convex game (V,g). 

Theorem 9. [7] For a convex game (V,g), the Shapley value <pg G 3?^ is given 

by (j)g = fjg. □ 

As to computing the Shapley value, we easily observe the following in- 
tractability. 

Lemma 4. For a convex game (V,g), there is no algorithm that computes the 
Shapley value (fg G 3?^^ by using less than 2” — 1 function value oracle calls, 
where n = \V\. □ 

Now let us consider the counter part of the above results in a game {V, h) 
with an intersecting supermodular and nega-modular function h. By applying 
Theorems 1 and 6 to system {V, —h), we have the next result. 

Theorem 10. For a game (V, h) with an intersecting supermodular and nega- 
modular function h : 2'^ i— > 3?+, CORE{h) is nonempty if and only if h{V) > 
^Y^ch{v) ^0^) holds, where ch(V) is the set of maximal subsets X in the lami- 
nar family X obtained from (V,—h) by algorithm LAMINAR. Moreover, the set 
of all extreme points of CORE (h) is given by the set of tt - minimal vectors 
TT G Fin- □ 

From Theorem 7, we obtain the following. 
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Theorem 11. For a game {V, h) with an intersecting supermodular and nega- 
modular function h : 2^ 5R+, assume that CORE{h) 0. Then the mean 

vector ifh of all n-minimal vectors Zt^, tt € Un, can be computed by using 0{n^) 
function value oracle calls, where n = |y|. □ 

Moreover, the mean vector ipg is no longer equal to the Shapley value (fh, as 
shown by the next lemma. 

Lemma 5. For a game (V, h) with an intersecting supermodular and nega- 
modular function h : 2^ there is no algorithm that computes the Shapley 

value 4>h G 3?^ by using less than 2” — 1 function value oracle calls, where 
n=\V\. □ 

4 Conclusion 

In this paper, we showed that, for an intersecting submodular and posi-modular 
set function / on V, its polyhedron P-{f) is described by a set of inequalities 
z{X) < f{X) such that X is in a laminar family X C 2^ . Furthermore, such 
a laminar family can be obtained combinatorially by 0(|Mp) function value 
oracle calls. This significantly reduces the complexity of finding a vector ^ in 
the polyhedron P_(/). As a result, we show that several optimization problems 
over the polyhedron have efficient combinatorial algorithms, and that the core 
and its mean vector of some cooperative game can be efficiently computed. It is 
left for the future research to widen the class of set functions to which similar 
algorithms are applicable. 
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Abstract. In optical networks with wavelength division multiplexing 
(WDM), multiple connections can share a link if they are transmitted 
on different wavelengths. We study the problem of satisfying a maxi- 
mum number of connection requests in a directed tree network if only a 
limited number W of wavelengths are available. In optical networks with- 
out wavelength converters this is the maximum path coloring (MaxPC) 
problem, in networks with full wavelength conversion this is the max- 
imum path packing (MaxPP) problem. MaxPC and MaxPP are shown 
to be polynomial-time solvable to optimality if the tree has height one 
or if both W and the degree of the tree are bounded by a constant. 
If either W or the degree of the tree is not bounded by a constant, 
MaxPC and MaxPP are proved MV-haiA. Polynomial-time approxima- 
tion algorithms with performance ratio 5/3 -I- e for arbitrarily small e 
are presented for the case W = 1, in which MaxPC and MaxPP are 
equivalent. For arbitrary W , a 2- approximation for MaxPP in arbitrary 
trees, a 1.58-approximation for MaxPC in trees of bounded degree, and 
a 2.22-approximation for MaxPC in arbitrary trees are obtained. 



1 Introduction 

All-optical communication networks are the technology of choice for satisfying 
the ever-growing demands for telecommunication bandwidth. Data is transmit- 
ted through optical fiber at the rate of gigabits-per-second, and WDM 
(wavelength-division multiplexing) allows several connections to use a link simul- 
taneously if the signals are transmitted on different wavelengths. In all-optical 
networks with switches without wavelength conversion capabilities, a connection 
must use the same wavelength on the whole path from transmitter to receiver. 
If W wavelengths are available, a set of connections can be established if each 
connection is assigned a transmitter-receiver path and one of the W wavelengths 
such that connections sharing a directed link receive different wavelengths. An- 
other model of optical networks employs switches with wavelength conversion. A 
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connection can use different wavelengths on different segments of its transmitter- 
receiver path. If all network switches have full wavelength conversion capabilities, 
a set of connections can be established if they are assigned transmitter-receiver 
paths such that no directed link is used by more connections than the number 
of available wavelengths. 

As the number of distinct available wavelengths is small in practice, it is 
not always the case that all requests in a given set of connection requests can 
be established simultaneously under the constraints mentioned above. In such 
a scenario, the network provider might decide to reject some requests and to 
maximize the number of accepted requests. We investigate the complexity and 
approximability of this optimization problem for the case that the topology of the 
network is that of a directed tree, i.e., the graph obtained from an undirected tree 
by replacing each undirected edge by two directed edges with opposite directions. 



1.1 Preliminaries 

A connection request in a directed tree T is given by a sender-receiver pair (m, w) 
and corresponds to the directed path from rt to in T. We will refer to requests 
as paths in the remainder of this paper. Two paths intersect if they share a 
directed edge of T. For a given set P of paths, the load L{e) of a directed edge e 
of T is the number of paths in P using edge e, and L denotes the maximum load 
among all edges of T. A IF-coloring of a given set of paths is an assignment of 
colors (wavelengths) to the paths using at most W colors such that intersecting 
paths receive different colors. 

For given directed tree T, set P of paths in T, and number W of available 
colors, the maximum path coloring problem (MaxPC) is to compute a subset 
P' C P and a VF-coloring of P' such that \P'\ is maximized, and the maximum 
path packing problem (MaxPP) is to compute a subset P' C P with maximum 
load W such that |P'| is maximized. For a given instance of MaxPC or MaxPP, 
we denote by P* an arbitrary optimum solution. MaxPC models optical net- 
works without wavelength converters, while MaxPP models the availability of 
full wavelength conversion. 

By A we denote the maximum outdegree of all nodes in the given directed 
tree T. We assume that an arbitrary node r of T has been designated the root 
of T. For V ^ r, p(v) denotes the parent of v. The level of a node is its distance 
(number of edges) from r. For a pair (u, w) of nodes in T, we denote by lca(w, w) 
the unique least common ancestor (lea) of u and w, i.e., the node with smallest 
level among all nodes on the path from u to w. The one or two edges on a path 
(u,w) that are incident to lca(u,w) are called the top edges of the path (u,w). 
A subtree of T contains a path (u, w) if lca(u, w) is a node of the subtree. 

A polynomial-time algorithm is a p-approximation algorithm for MaxPC 
or MaxPP if it always outputs a set P' whose cardinality is at least a (1/p)- 
fraction of the cardinality of an optimum solution. An algorithm that computes 
an optimum solution in polynomial time is called an exact algorithm. 
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1.2 Related Work 

Previous work has focused on the path coloring problem, where the goal is to as- 
sign wavelengths to all given connection requests while minimizing the number 
of different wavelengths used. For undirected trees, a (3/2)-approximation algo- 
rithm was given in [10] and improved to an asymptotic 1.1-approximation in [2]. 
For directed trees, the best known algorithm colors a given set of directed paths 
with maximum load L using at most [(5/3)L] colors [8, 5, 7, 6]. While the path 
coloring problem is relevant when a provider designs a network in order to meet 
the given demands or when the network has enough capacity for satisfying all 
given requests, MaxPC and MaxPP apply to the case where an existing network 
has insufficient capacity and the goal is to maximize the number of accepted 
requests. 

In chain networks, MaxPC and MaxPP are equivalent and can both be solved 
optimally in polynomial time by finding a maximum IF-colorable subgraph in 
the conflict graph, which is an interval graph in this case [12]. For undirected 
trees, the exact algorithm for integral multicommodity flow with unit edge capac- 
ities from [3] (see below) gives an exact algorithm for MaxPC and MaxPP with 
W = 1, and using the reduction from Sect. 4 a 1.58-approximation for MaxPC 
with arbitrary W is obtained. The same results can be derived for ring net- 
works [11]. For ring networks with predetermined routing of the given requests, 
a 3/2-approximation algorithm for MaxPC was obtained in [9]. A variant of the 
MaxPC problem is considered in [1]; see Sect. 4 for more details. 

MaxPP is closely related to the integral multicommodity flow problem. Inte- 
gral multicommodity flow and multicut have been studied for undirected trees 
in [3]. For a given set of source-sink pairs (commodities) in an undirected tree 
with edge capacities, the integral multicommodity flow problem is to maximize 
the sum of the flows of the commodities constrained by the given edge capacities. 
Exact algorithms for integral multicommodity flow are obtained in [3] for trees of 
height one with arbitrary edge capacities and for arbitrary trees with unit edge 
capacities. For trees with edge capacities 1 or 2 the problem is proved A/”7^-hard 
and MAX SNP-hard. For trees with arbitrary edge capacities, a 2-approximation 
algorithm is given. The main differences between the multicommodity flow prob- 
lem considered in [3] and the MaxPP problem studied in the present paper are 
that we investigate directed instead of undirected trees and that no commodity 
can have a flow greater than 1 in our setting. 



1.3 Results 

We determine the complexity of MaxPP and MaxPC in directed trees under 
various restrictions and give approximation algorithms with small constant ap- 
proximation ratios for the variants that are A/"7^-hard. We are not aware of any 
previous results regarding MaxPP and MaxPC in directed trees. Our complexity 
results are listed in this section without giving proofs; details can be found in 
the full paper available from the authors. 
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MaxPP and MaxPC are equivalent if the given tree has height one and can 
be solved in polynomial time using an algorithm for capacitated 6-matching 
[4, pp. 257-259]. This algorithm extends to the weighted version of MaxPC 
and MaxPP, where each path p has associated benefit h{p) and the goal is to 
maximize the total benefit of accepted paths. MaxPP and MaxPC can also 
be solved optimally in polynomial time using a bottom-up computation if the 
maximum degree A of the given tree network and the number W of available 
wavelengths are bounded by a constant. This algorithm extends to the weighted 
version of MaxPC and MaxPP as well, and also to variants where the set of 
available wavelengths can vary from link to link or where wavelength converters 
with limited conversion are allowed. (Furthermore, variants of the algorithm 
give exact algorithms for integral multicommodity flow in directed or undirected 
trees of bounded degree, if the edge capacities are bounded by a constant.) If 
either A or W can be arbitrarily large, MaxPP and MaxPC become AfT^-hard. 
More precisely, both problems are AfT^-hard for IT = 1 and arbitrary degree, 
and for arbitrary W and degree bounded by 3 (i.e., binary trees). 

For MaxPP with arbitrary W, we adapt the algorithm from [3] and obtain 
a 2-approximation (Sect. 2). If IT = 1, i.e., only one wavelength is available, then 
MaxPC and MaxPP are equivalent to finding a maximum cardinality subset of 
edge-disjoint paths, and we give, as our main result, a family of polynomial-time 
approximation algorithms with approximation ratio 5/3 -I- e for this case, where e 
can be chosen arbitrarily small (Sect. 3). For MaxPC with arbitrary IT, we obtain 
a 2.22-approximation for trees of arbitrary degree and a 1.58-approximation for 
trees whose degree is bounded by a constant (Sect. 4). 

2 Approximating MaxPP 

The algorithm is as follows. Initially, set P' = %. Then process all nodes of the 
tree in order of non-increasing levels. When processing node v, consider the paths 
whose lea is v in arbitrary order. Insert each such path in P' if this does not 
increase the maximum load of P' above IT. In the end, output P'. (Note that 
this algorithm for MaxPP works also if the number of available wavelengths is 
different on different links.) 

Theorem 1. The algorithm is a 2 -approximation algorithm for MaxPP, i.e., it 
outputs a subset P' of P with maximum load at most W and with cardinality at 
least |P*|/2. 

Proof. First, we observe that the approximation algorithm from [3] works also 
for directed trees. The only further difference between the multicommodity flow 
problem in [3] and the MaxPP problem is that, with the MaxPP problem, no 
commodity can have flow greater than 1. But our greedy algorithm for MaxPP 
in a directed tree T with edge capacity IT behaves like the algorithm from [3] 
in a slightly extended tree T': for each path (commodity) p from a node u to 
a node w, add two new nodes Up and Wp, add two unit capacity edges (up,u) 
and {w,Wp), and replace p by a path from Up to Wp. The multicommodity flow 
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problem in the resulting tree T' is equivalent to the MaxPP problem in the 
original tree, and our greedy algorithm produces the same solution as the 2- 
approximation algorithm from [3] on this instance. □ 

3 A Family of Approximation Algorithms for W = 1 

The algorithm from the previous section achieves approximation ratio 2 also for 
W = 1. The main idea that leads to an improvement in this special case is to 
consider all paths with the same lea simultaneously instead of one by one. This 
way we obtain, for any fixed e > 0, a polynomial-time (5/3 -I- e)-approximation 
algorithm. 

Let Py denote the subset of all paths {u, w) G P with lca(it, w) = v that do 
not intersect any of the paths that have been accepted by the algorithm at a 
previous node and that do not use any edges that have been reserved or fixed 
by the algorithm (see below) . We assume without loss of generality that we have 
u V w for all paths (u, w) G Py. 

During a first pass, the algorithm processes the nodes of T in order of non- 
increasing levels. When the algorithm processes node v, it tries to determine for 
the paths in Py whether they should be included in P' (these paths are called 
accepted) or not (these paths are called rejected). But the algorithm can not 
always make this decision right away. In some cases it leaves some paths in an 
intermediate state and resolves them later on. The possibilities for paths in such 
intermediate states are undetermined paths, groups of deferred paths, groups of 
exclusive paths, and groups of 2-exclusive paths. We refer to undetermined paths, 
groups of exclusive paths, and groups of 2-exchisive paths (but not groups of 
deferred paths) as unresolved paths. 

If all paths in Py use the same two top edges, accepting one of them might 
cause the algorithm to miss the chance of accepting two paths with an lea of 
smaller level later on. Instead, the algorithm picks one of the paths in Py and 
makes it an undetermined path in this case. 

Sometimes the algorithm decides to accept one of several intersecting paths, 
but it defers the decision which one of them to accept. The intersecting paths are 
called a group of deferred paths (see Fig. 1), and some edges (indicated by dotted 
arrows in Fig. 1) are marked as reserved. The motivation for introducing groups 
of deferred paths is as follows: first, the reserved edges block at most one path 
with lea of smaller level that could be accepted in an optimum solution; second, 
no matter which paths with lea of smaller level not intersecting a reserved edge 
are accepted by the algorithm later on, there is still at least one deferred path 
that can be accepted in a second pass that proceeds top-down. 

A group of exclusive paths is sketched in Fig. 2(a). Such a group consists of 
a lower path p and a higher path q with lea of smaller level that intersects p. At 
most one of the two paths can be accepted, but the algorithm can not afford to 
pick the wrong one. It only marks the top edge of p that is intersected by q as 
fixed (indicated by a dotted arrow in Fig. 2). Groups of exclusive paths have the 
following property. 
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Fig. 1. Possible configurations of deferred paths 







Fig. 2. (a) Possible configuration of exclusive paths; (b) situation in which both 
exclusive paths are blocked; (c) and (d) groups of 2-excliisive paths 



Property (E): As long as at most one path touching v but not using the fixed 
edge is accepted at a later node, either p or q can still be accepted. Only when 
two paths touching v are accepted at a later node, they may block both p and q 
from being accepted (see Fig. 2(b)). 

The last types of unresolved paths are sketched in Fig. 2(c) and (d). These 
groups of 2-exclusive paths consist of a set of four paths at most two of which 
can be accepted and have the following property. 

Property (2E): If at most one path touching v but not using a fixed edge is 
accepted at a later node, two paths from the group of 2-exclusive paths can still 
be accepted. If two paths touching v but not using a fixed edge are accepted at 
a later node, at least one path from the group of 2-exclusive paths can still be 
accepted. 

When the algorithm has finished processing a node v, the subtree rooted at v 
will contain at most one of the following: one undetermined path, or one group of 
exclusive paths, or one group of 2-exclusive paths. All other paths in the subtree 
are accepted, rejected, or member of a group of deferred paths. 
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3.1 Invariants 

In the next subsection we will sketch how the algorithm proceeds during the first 
pass. At the same time, we will show that the approximation ratio achieved by 
the algorithm is 5/3 + e. For establishing this, it can be proved by induction that 
the following invariant can be maintained. This invariant holds before the first 
node of T is processed, and it holds again each time an additional node of T has 
been processed. 

Let A be the set of all paths that have already been accepted by the algorithm. 
Let F be the set of all paths in P whose lea has not yet been processed and which 
are not blocked by any of the accepted paths, by reserved edges of deferred 
paths, or by fixed edges. Let d be the number of groups of deferred paths that 
are contained in processed subtrees. Let U be the set of all undetermined paths. 
Let X be the union of all groups of exclusive paths and groups of 2-exclusive 
paths. Then there is a set O C F U U U X of edge-disjoint paths satisfying the 
following conditions: 

Condition 1: |P*| — \0\ < (5/3-1- e)(|A| -|- d) 

Condition 2: For every group of exclusive paths, O contains one path from 
that group; for every group of 2-exclusive paths, O contains two paths from that 
group. 

O represents a set of paths that could still be accepted by the algorithm and 
that has the following property: if the algorithm accepts at least a l/(5/3 -I- e)- 
fraction of the paths in O (in addition to the paths it has already accepted), its 
output is a (5/3-1- e)-approximation of the optimum solution. 

The invariant is satisfied initially with A = d = 0, F = P, U = X = 
and O = P* . In order to prove that the invariant can be maintained, it suffices 
to show how the set O that establishes the invariant before node v is processed 
can be manipulated so as to satisfy the invariant also after node v is processed. 
In particular, some paths must be replaced in O or removed from O in order 
to satisfy O C F U U U X and to keep O a set of edge-disjoint paths after v is 
processed, and it must be shown that the number of paths removed from O is at 
most (5/3-|-e)(a„-|-(i„) if the algorithm accepts additional paths and creates dy 
new groups of deferred paths while processing v (thus satisfying Condition 1). 
Condition 2 must only be considered explicitly when a new group of exclusive 
paths or group of 2-exclusive paths is created. 

If the invariant is satisfied after the root node is processed, we have F = 0, 
O C U U X , and |P*| — |0| < (5/3 -I- e)(|A| -|- d). By accepting the undetermined 
path (if any), accepting an arbitrary path from the group of exclusive paths 
(if any), and accepting two arbitrary edge-disjoint paths from the group of 2- 
exclusive paths (if any), the algorithm accepts \0\ additional paths, and the 
resulting set A satisfies |P*| < (5/3-|-£)(|A|-|-d). In the second pass, the algorithm 
processes the nodes of the tree in order of non-decreasing levels and accepts g 
additional paths at each node v that is the lea of g groups of deferred paths. Then 
the algorithm outputs the set of all accepted paths. As the algorithm accepts d 
additional paths, one from each group of deferred paths, in the second pass, this 
establishes our main theorem. 
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Theorem 2. For every fixed e > 0, there is a polynomial-time approximation 
algorithm for MaxPC and MaxPP with one available color (wavelength) having 
approximation ratio 5/3 + e. 



3.2 The First Pass 

Recall that C P is the set of all paths with lea v that do not intersect any 
previously accepted path nor any fixed or reserved edge. Let C/„ be the set of 
undetermined paths contained in subtrees rooted at children of v. Let Xy be the 
union of groups of exclusive paths and groups of 2-exclusive paths contained in 
subtrees rooted at children of v. We sketch how the algorithm processes node v 
and determines which of the paths in PyUUyU Xy should be accepted, rejected, 
deferred, or left in an unresolved state. 

Note that for a given set of paths with lea v the problem of determining 
a maximum subset of edge-disjoint paths is equivalent to finding a maximum 
matching in a bipartite graph [8] and can thus be done in polynomial time. 
Furthermore, we use the following property of bipartite graphs: for s = 1 or 
s = 2, the fact that a maximum matching in a bipartite graph G has cardinality s 
implies that there are s vertices in G such that every edge is incident to at least 
one of these s vertices. (The property holds for arbitrary s and is known as the 
Konig theorem.) 

Let k be the number of children of v that have an undetermined path in their 
subtree, let i be the number of children that have a group of exclusive paths, 
and let m be the number of children that have a group of 2-exclusive paths. We 
use the expression subtrees with exclusive paths to refer to all subtrees of v with 
either a group of exclusive paths or with a group of 2-exclusive paths. Before v 
is processed, the invariant implies that there is a set O C F U C/ U X satisfying 
Condition 1 and 2. In each single case of the following case analysis, it must be 
shown how a set O' can be obtained from O such that Condition 1 and 2 are 
satisfied for O' after v is processed. 

Case 1: k-\-£-\-m < max{3, 2/e}. The algorithm can try out all combinations 
of accepting or rejecting unresolved paths in the subtrees rooted at children of v: 
for undetermined paths there are two possibilities (accepting or rejecting the 
path), for groups of exclusive paths there are two possibilities (accepting the 
lower path or accepting the higher path), and for groups of 2-exclusive paths 
there are at most four relevant possibilities of accepting two edge-disjoint paths 
of the group (see Fig. 2). Hence, the number of possible combinations is bounded 
from above by = 0(1). 

For each of these combinations, the algorithm can compute a maximum num- 
ber of edge-disjoint paths in Py not intersecting any of the (tentatively) accepted 
paths from Uy U Xy. Let s be the maximum, over all combinations, of the num- 
ber of tentatively accepted paths from Uy U Xy plus the number of maximum 
edge-disjoint paths in P„. If s = 0, we have k = i = m = 0 and Py = %, and the 
algorithm proceeds with the next node. Otherwise, we distinguish the following 



cases. 
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If s > 3, the algorithm accepts the s paths and rejects all other paths 
from Py Lilly U Xy . As s is the maximum number of edge-disjoint paths in Py U 
Uy U Xy, O can contain at most s paths from U U Xy. Furthermore, O 
can contain at most two paths from F using the edges (v,p(v)) or (p(v),v), and 
these are the only two further paths in O that could possibly be blocked by the s 
paths accepted by the algorithm. Hence, a valid set O' can be obtained from O 
by deleting at most s + 2 paths. As s-|-2 < (5/3)s, the invariant is maintained. In 
the cases s = 1 and s = 2, a huge number of subcases for k, I and m such that 
fc-|-^-l-2m < 1 resp. k + £ + 2m < 2 is distinguished. For each of these subcases, a 
number of configurations of paths in Py with respect to the paths in Xy U Uy are 
considered. Each such subcase and configuration can be recognized in polyno- 
mial time, and it can be shown that one of the following actions can satisfy the 
invariant: (1) The algorithm creates a new undetermined path or a new group of 
exclusive paths. A valid set O' can be derived from O by replacing or inserting 
one path, if necessary. (2) The algorithm creates a new group of deferred paths. 
In this case, a valid set O' can be derived from O by deleting at most one path. 
(3) The algorithm accepts paths and/or creates groups of deferred paths such 
that ay + dy = 2. In this case, a valid set O' can be derived from O by deleting 
at most 3 paths. (4) The algorithm creates a group of 2-exclusive paths from 
some paths in Py U Xy U Uy . In this case, a valid set O' can be derived from O by 
replacing or inserting at most two paths. All details are omitted in this extended 
abstract due to space limitations. 

Case 2: k + I + m > max{3,2/e}. The algorithm calculates four candidate 
sets Si,. .. ,84 of edge-disjoint paths from PyLiUyUXy and chooses the largest of 
them. For obtaining S 2 and S 4 , we employ a method of removing r paths from 
an arbitrary set S of edge-disjoint paths in Py such that i + 2m exclusive paths 
from Xy can be accepted in addition to the paths remaining in S. The details of 
the method and a proof that r < {\S\+ £ + m)/i are omitted. S\ is obtained as 
the union of all k undetermined paths and a maximum number si of edge-disjoint 
paths from Py not intersecting any undetermined path, and as many additional 
edge-disjoint paths from the £ + m subtrees with exclusive paths as possible. 
We have |S'i| > k + si + m, because Si contains k undetermined paths and at 
least m paths from groups of 2-exclusive paths in Xy due to Property (2E). S 2 
is obtained from Si by removing r of the si paths in from Si until £+ 2 m 

exclusive paths can be accepted. S 2 contains £ + 2 m exclusive paths, and only 
r S (si+£+m)/‘i of the si paths in S'inP„ were removed to obtain S 2 . As S 2 still 
contains the k undetermined paths, we have |S'2| > k+m+{ 2 /'i){si+£+m). S 3 is 
obtained by taking a maximum number S3 of edge-disjoint paths from Py and as 
many additional edge-disjoint paths from the £+m subtrees with exclusive paths 
and the k subtrees with undetermined paths as possible. We have 1531 > S3 -km, 
because S 3 contains at least m paths from groups of 2-exclusive paths in Xy due 
to Property (2E). ^4 is obtained from S 3 by removing r of the S3 paths in 838 ] Py 
from S 3 until £ + 2m exclusive paths can be accepted. Since r < (53 -I- ^-k m)/3, 
we have |54| > m + (2/3)(s3 + £ + m). We claim that the number of paths 
in = O n {Py UUyLl Xy) is at most si -k (S3 — si)/2 -k fc -k £-k 2m. With this 



188 



Thomas Erlebach and Klaus Jansen 



upper bound on 1 | and the lower bounds on the cardinalities of the four sets Si , 
it can be proved that at least one of the sets Si satisfies |0«| + 2 < (5/3 + e)|S'i|. 
At most |0„| + 2 paths must be removed from O in order to obtain a valid set O' . 

4 Approximating MaxPC for Arbitrary W 

In order to obtain an approximation algorithm for MaxPC with arbitrary num- 
ber W of available wavelengths from an algorithm for bP = 1, we employ a 
technique from [1]. The approximation algorithm A for arbitrary number W of 
wavelengths is obtained from an approximation algorithm A\ for one wavelength 
by running W copies of Ai and giving as input to the i-th copy the set of paths 
that have not been accepted by the first i — 1 copies of Ai. The output of A is 
the union of the W sets of paths output by the copies of Ai, and the paths in 
the i-th set are assigned colored i. 

In [I] it is shown that the algorithm A obtained in this way has approximation 
ratio at most p+ \ if Ai has approximation ratio p, even if different wavelengths 
are associated with different network topologies. There the technique is used 
to obtain randomized on-line algorithms with logarithmic competitive ratio for 
networks shaped as rooted forests. For identical networks, which we have in 
our application, the approximation ratio achieved by A can even be bounded by 
1/(1 — (1 — 1/ (pbF))'^), which is smaller than 1 / (1 — e“^/^) for all W. This bound 
is mentioned in a preliminary draft of the journal version of [1], which was kindly 
supplied to the authors by Adi Rosen. The bound can be proved easily by using 
the fact that, if A has selected pk paths after running k copies of Ai, there is still a 
set of at least {\P*\—pk)/W edge-disjoint paths among the remaining paths (this 
follows from a pigeonhole argument), and the next copy of Ai accepts at least 
a (l/p)-fraction of them. As we have obtained an exact algorithm for MaxPC 
with bF = 1 in bounded degree trees and (5/3 -I- e)-approximation algorithms 
for MaxPC with bF = 1 in arbitrary trees, by employing the above technique we 
obtain approximation algorithms with ratio 1/(1 — 1/e) « 1.58 for arbitrary bF 
in bounded degree trees and with ratio « 2.22 for arbitrary bF in arbitrary trees. 



5 Open Problems 

It is an interesting open question whether the approximation ratios of our al- 
gorithms for directed trees can be improved. Other promising directions for 
future research include approximation algorithms for the weighted MaxPP and 
MaxPC problems in arbitrary trees and for the MaxPC problem with different 
sets of available wavelengths on different links. In addition, it would be very 
interesting to see whether techniques we used in the (5/3 -I- e)-approximation 
for MaxPC and MaxPP with bF = 1 can lead to improved approximation al- 
gorithms for the integral multicommodity flow problem in trees in general or 
for special cases thereof; for this problem, the best known approximation is still 
the 2-approximation from [3] . 
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Abstract. In this paper we study the problem of finding the k largest 
elements of n distinct real numbers. We give a pure combinatorial ar- 
gument to prove that n + (k — 1) logn + 0(1) queries are necessary for 
any deterministic algorithm using parity tests only. This settles an open 
problem hrst raised by Yao [11]. We also present a randomized algorithm 
with error probability 0(l/n'^) using only 0(log^ n-|-fc log n) parity tests, 
where c > 1 is any fixed constant. 



1 Introduction 

This paper is concerned with the problem of finding the k largest elements of n 
distinct real numbers. This problem, as well as other variants of the selection 
problem, has been studied extensively. Based on the pairwise comparison model 
(i.e., each query takes the form “xi — Xj : 0” where Xi,Xj are distinct inputs), it 
is known that n -I- (fc — 1) log n + 0(1) comparisons^ are necessary and sufficient 
to find the k largest elements (Hyafil [4] , Kirkpatrick [.5] , Kislitsyn [6] , Pratt and 
Yao [7]). To go further, people have studied more general queries. Fussenegger 
and Gabow [2] showed that even when a query is any linear function of the 
inputs, the same lower bound applies. Yao [11] is the first to go beyond linear 
function and proved that the same lower bound holds for median test, which 
is of the form “(si — Xj){xi — Xk) ■ 0”. Fleischer [1] extended this lower bound 
result to any arbitrary products of linear functions. 

In this paper we focus on parity test, which is of the form “n(i ■ 0” 

where C is any collection of pairs of input positions. The study of parity test 
on selection problems is not new in the literature; it is actually the only kind 
of polynomial test people know how to make use of in deriving better upper 
bound results for selection problems (see e.g., [8,9]). Fleischer’s result [1], of 
course, implies the above-mentioned lower bound for parity test. Nevertheless, 
the work of Yao [11] and Fleischer [1] for non-linear test is not straightforward 
(though thought-provoking); the key lemmas are based on geometric arguments, 
reasoning the relationship of the regions in the n-dimensional space defined by 
the queries. An open problem, first raised in [11], is to find a pure combina- 
torial proof of the lower bound for median test. This paper presents a simple 
combinatorial proof for parity test, settling this open problem. 

^ Throughout the paper, the base of the logarithm is 2. 
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The second contribution of this paper is an efficient randomized algorithm 
for finding the k largest elements. We show that, for any constant c > 1, there is 
a randomized algorithm with error probability 0(l/n'^) for finding the k largest 
elements using only 0(log^n + fclogn) parity tests. We remark that random- 
ization cannot help if only pairwise comparisons are allowed. More precisely, we 
show that the error probability of any randomized algorithm using o(n) pairwise 
comparisons is at least 1 — o(l). In other words, such an algorithm returns an 
incorrect answer almost surely. 



2 A Lower Bound for Selecting the k Largest Elements 



In this section we show that if only parity tests are allowed, any deterministic 
algorithm for finding the k largest elements of n distinct numbers has to make 
at least n — k + [logn(n — 1) • • • (n — fc -|- 1)] tests in the worst case. An ordered 
pair {i,j) is said to be an index pair if i,j are distinct integers in {1, 2, • • • , n}. 
A parity test takes the form of ~ where C is a set of index 

pairs. 



2.1 Model of Computation 

We model a selection algorithm by a decision tree. A decision tree is a binary tree 
in which every internal node is labeled with a test over the inputs (e.g., a parity 
test “ri(i ~ every leaf is labeled with an output (say, the 

indices of the largest elements). Every internal node has exactly two outgoing 
edges labeled with “> 0” and “< 0”. Given any input X = {xi,X 2 t--- ,a;„}, 
the execution of the selection algorithm corresponds to tracing a path from the 
root of the decision tree to a leaf. More precisely, starting from the root, we 
repeatedly perform the test specified in a node and follow the outgoing edge 
whose label matches the result of the test. The execution terminates when a leaf 
is reached. The output for the input X is the label of that leaf. The length of 
the longest path from the root of a decision tree to any of its leaf represents the 
number of tests the selection algorithm performs in the worst case. 



2.2 A Counting Argument of Fussenegger and Gabow 

Most of the existing lower bound results [1,2,3,11] on the problem of selecting 
the k largest elements make use of a counting argument introduced by Fusseneg- 
ger and Gabow [2]: 

Given a decision tree T for finding the k largest elements of n numbers, 
it is always possible to construct n{n — \) ■ ■ ■ {n — k + 2) subtrees of T 
which do not share any leaves and each of which defines an algorithm 
for finding the maximum oi n — k + 1 numbers. 
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Assume that T only uses tests of type Q. If one can show that any decision 
tree that uses tests of type Q to find the maximum of m numbers must have at 
least 2"*“^ leaves, then T must have 2"'“^n(n — 1) ■ ■ ■ {n — k + 2) leaves, and its 
height must be at least n — k + [log n{n — 1) • • • (n — fc + 2)] . The latter is also 
a lower bound on the number of tests made by T in the worst case. 

In the next subsection, we give a proof of the following lemma. 

Lemma 1. If only parity tests are allowed, then any decision tree for finding 
the maximum element of n distinct numbers must have at least 2"“^ leaves. 

Lemma 1 together the above-mentioned counting argument give us the fol- 
lowing theorem. 

Theorem 1. If only parity tests are allowed, then any algorithm for finding 
the k largest elements of n distinct numbers must make at least n — k + \ 
logn(n — 1) • • • (n — fc -I- 2)] = n -I- (fc — 1) logn -I- 0(1) tests in the worst case. 

2.3 A Combinatorial Lower Bound Proof 

Let T be any decision tree for finding the maximum element of n distinct real 
numbers using parity tests only. We assume that for any leaf of T, there exists at 
least one input X = (xi,X 2 , - ■ ■ ,Xn) such that the execution for X terminates 
at this leaf. In this subsection we show that every path from the root to a leaf 
of T has at least n — 1 internal nodes. Then Lemma 1 follows. 

Pick an arbitrary leaf a of T . Let h be the number of internal nodes along 
the path between the root and cr. Let Ci, C 2 , • • • ,Ch be the sets of index pairs 
corresponding to the tests at these internal nodes. Denote Q as the collection 
{Cl, C 2 , • • • ,Ch}. Without loss of generality, we make the following assumptions: 

• O’ is labeled with the output 1, i.e., the first input is the maximum. 

• All the edges along the path from the root to cr are labeled with “> 0” . That 
is, if the execution for an input X terminates at a, then n(..,) ^c,{xi-Xj) > 0 
for all Cl G Q. 

Suppose, by way of contradiction, that Q contains h < n — 1 sets of index 
pairs. Let X = {xi,X 2 ,‘‘- ,Xn) be an input whose execution terminates at cr. 
That is, xi is the maximum element in X. We are going to construct another 
input Z = (zi, Z 2 , ■ ■ ■ , Zn) from X such that the execution for Z also terminates 
at cr, but zi is not the maximum element in Z. Thus, we obtain a contradiction. 

To ease our discussion, we further assume that Ci , C 2 , • • • ,Ch are each term- 
wise positive with respect to X, i.e., for every index pair (i,j) € C/, (xi — Xj) > 0. 
Below we explain why this assumption is valid. Consider any Ci G Q. Since 
Y\{^i j)(zCi{xi — Xj) > 0, there are an even number (possibly zero) of index pairs 
(z, j) G Cl such that {xi — Xj) < 0. Let us replace all such index pairs (i,j) with 
(j, i). Then Ci results in a test which produces the same outcome for all possible 
inputs and which is term- wise positive with respect to X . 

Roughly speaking, if h (i.e. |5|) is less than n— 1, we can show that some xfs 
are not tested thoroughly by Q. More specifically, we say that U C (1,2,... ,n} 
is a set of obscure positions if U satisfies the following properties: 
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• U includes 1 and at least one other position in {2, • • • ,n}. 

• For each Ci G G, there is an even number of index pairs {i,j) in Ci where i 
is any index and j is in U (i.e. \{{i, j) G Ci \ j G U}\ is even). 

Z will be constructed in such a way that the maximum element can be placed 
in any position in U other than the first position. 

Before proving the existence of such a set U , we need one more notation. 
For any Ci G G and j G {1,2, • • • ,n}, define Nij to be the number of index 
pairs (*,j) in Ci, where i is any index in {1,2,- • • ,n}. Obviously, u ~ 



Lemma 2. For any Ci G G, Nn = 0. 

Proof. Since every Ci G G is term-wise positive with respect to X and xi is 
the maximum element of X, no Ci contains an index pair (f, 1) for any i G 
{1,2,- • • ,n}. Thus, Nil = 0. 

Lemma 3. Assume that |^| = h < n — 2. There exists a nonempty set U\ C 
{2, 3, • • • ,n} such that for all Ci G G, sven. 

Proof. There are 2”“^ — 1 distinct nonempty subsets of (2, 3, • • • , n}. Let V be 
one of such subsets. For each I G {I,-- - ,h}, J2jev either odd or even. 

We consider the parity of J2jev alH as a vector of length h. Since there 

are h < n — 2 Cj’s, there are at most 2"“^ distinct vectors. Thus, there must 
exist two distinct subsets Vi and V 2 such that for all Ci G G, J2jeVi 
have the same parity. 

Let U\ be the symmetric difference of Vi and V 2 (i.e. {V\ U V 2 ) — {V\ n V 2 )). 
f7i is non-empty. Moreover, for any Ci G G, (X]jg£/i -^b) even. 

By Lemmas 2 and 3, we can conclude that f7 = {1} U is a set of obscure 
positions. The next lemma shows how to construct an input Z based on such a 
set U , which contradicts that inputs terminates at cr should have their maximum 
element in the first position. 

Lemma 4. If h < n — 2, then there is an input Z = (zi, Z 2 , - ■ ■ , Zn) such that zi 
is not the maximum element and, for any Ci G G, Y\{i ~ ^ 

Proof. Let U be the set of obscure positions defined above. Recall that xi is the 
maximum element of X . Define Z follows. For any z G {1, 2, • • • , n}. 



Consider any C/ in G. Ci is term-wise positive with respect to X. Thus, for 
any index-pair (z,j) G Ci , {zi — zj) < 0 if and only if j G U. By definition 
of U, C has an even number of index pairs (i,j) where j G U. We conclude that 



We claim that zi is not the maximum element in Z . Since |f7| > 2, there ex- 
ists jo G U and jo 7^ 1- By definition of Z, Zj„ = —Xjg and Zi = —Xi Therefore, zi 
is smaller than Zjg and Zi is not the maximum element. 



\{{l,j)GCi\jGU}\. 




—Xi if i G U, 

Xi — 2x I otherwise. 



n(iy)GCi(^® > 0- 
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In conclusion, if the path from the root of T to cr contains h < n — 1 parity 
tests, there exists an input Z of which the maximum element is not in the first 
position, yet the execution of T for Z terminates at a. A contradiction occurs. 



3 A Randomized Algorithm 

In this section we present a randomized algorithm with error probability 
for selecting the k largest elements of n distinct numbers X = {xi,X 2 , - ■ ■ ,Xn} 
using 0(log^ n+k log n) parity tests, where c > 1 is any fixed constant. The parity 
tests performed by our algorithm are actually very restricted, all in the form 
— a;j) : 0” where B is any set of input positions. Our work is motivated 
by Ting and Yao’s randomized algorithm [9] for finding the maximum of n 
numbers, which uses 0(log^ n) restricted parity tests and has error probability 
0(n“'^). Note that by repeating Ting and Yao’s algorithm k times, we can 
select the k largest using 0(fclog^ n) restricted parity tests, the error probability 
is 0{kn~'^). 

Our improved algorithm is called Select, which repeatedly calls a func- 
tion called FindLarger to find an element in the input larger than a given 
element. The following notation will be used throughout this section. For any 
/C - ,n}, denote Xj as the subset {xi | i S /}. 



3.1 FindLarger 

FindLarger is a function which, given any I C {1, 2, • • • , n} and s € I, finds an 
index s' G I such that Xs> > Xg- The function may fail to find such an s' and 
return 0. Details are as follows. We randomly choose a subset S of / — {s} and 
perform the restricted parity test ~ ■ 0”- ~ ^j) < 0, 

Xb must contain an odd number of elements greater than Xg. We find such an 
element using the following binary search. We randomly choose a subset of B 
with exactly [|i?|/2] elements. Denote B' as this subset and let B" = B — B' . 
We perform one more restricted parity test, Hjes' Note that either 

~ ~ than zero. Depending on whether 

~ rijGB"(2^s ~ ^j) i® 1®®® than zero, we reduce the problem of 

finding an element in Ab that is greater than Xg to the same problem for B' 
or B" , respectively. By repeating the process [log |i?|] times, we can eventually 
find an index s' with Xgi > Xg. 

It remains to consider the case when IljeB ^ Since B may contain 
no elements greater than Xg, B is simply discarded (in this case, we say B fails). 
We choose another subset B randomly and repeat the process again. We will 
give up if, in A = [clogn -F log fc] attempts, the B’s chosen all fail. In this case, 
FindLarger returns 0. 

FindLarger makes at most A-F [log n] restricted parity tests. Note that all the 
random subsets used by FindLarger are chosen uniformly and independently. 
If FindLarger does report an index s' such that Xg> > Xg, then every index in 
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the set L = {he / | xt > Xg}, by symmetry, has equal probability to be re- 
ported. The following lemma is concerned with the probability that FindLarger 
successfully reports an index s' . 

Lemma 5. If Xs is the maximum element in Xj, then FindLarger returns 0. 
Otherwise, the function returns an index s' £ L with probability 1 - 

Proof. If Xs is the maximum element in Xj, then ~ ^ 

subset B of I. Thus, FindLarger must return 0. Assume that Xs is not the 
maximum element. For any subset B, the probability that B contains an even 
number of elements in L (and hence ~ ^ exactly 1/2. The 

probability that the subsets B chosen in the A attempts all fail is 1/2^. Thus, with 
probability 1 — FindLarger encounters a subset B such that < 0. 

Then, using the binary search, we can find an index s' £ B such that Xs> > Xs- 

3.2 The Selection Algorithm 

We are ready to describe the algorithm Select. The computation of Select is 
divided into k phases; the ith largest element of X is reported in Phase i. The 
major data structure kept by Select is a stack of indices, denoted S below. 
Initially, an element in {1,2, • • • ,n} is picked randomly and pushed onto S. In 
general, S stores the indices of some elements that have not yet been reported 
by Select. 

The computation involved in Phase i is as follows. Let , X(^ ,■■■ , xi-_,, be 
the elements reported in previous phases. Let /= {1, 2, • • • , n} — {£i,£ 2 ,. ■ .,4-i}- 
To find the ith largest element. Select attempts to determine the maximum 
element in Xj. The first contender is Xg, where s is the index at the top of S. 
Select invokes the function FindLarger to find out whether any element in Xj 
is greater than Xg. 

FindLarger(J, s) returns 0: From Lemma 5, with probability at least 1 - 
Xg is the maximum element in Xj. Select declares that Xg is the ith largest 
element (though it may not be the case) and pops s out from S. If S becomes 
empty, an index in /— {s} will be picked randomly and pushed onto S. Select 
then enters Phase i -F 1 . 

FindLarger(J, s) returns an s' > 0: Because Xg' > Xg, we push s' onto S. 
Then Xg' is the next contender for the ith largest element of X. 

Note that the sequence of contenders being tested in each phase is strictly 
increasing. This implies that Select will exit a phase after calling FindLarger 
at most n — 1 times. However, we want to limit the total number of calls of 
FindLarger over all phases to be O(logn). Thus, Select keeps a counter count 
for recording the total number of times FindLarger have been called. Let 77 = 
7c log n + 2k. When count exceeds the value of rj, Select gives up and simply 
returns “failure” . 

A formal description of the algorithm is given in Figure 1. Select calls 
F indLarger at most 77 times and each F indLarger makes at most A + [log n~\ 
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input: X = {xi,X 2 , ■ ■ ■ ,x„) 
output: The k largest elements of x 

/ ^ {1, 2, • • • , n}; ?7 = lOcflogn] + 2k- 
create an empty stack S\ 
randomly pick an s € /; 
push(5, s); phase <— 1; count <— 0; 
while {phase < k) and {count < rj) do { 

s <— top(S); 

s' <— FindLarger(J, s); count <— count + 1; 
if s' 7 ^ 0 then push(S', s') 
else { 

output (is); I ^ I — {s}; 
pop(5); 

if S = (j> then { randomly pick an index a in 7; push(S', z); } 
phase <— phase + 1; 

} 

} // end of while 

if count = rj then return (“failure”); 



Fig. 1. The Algorithm Select 



restricted parity tests, the total number of restricted parity tests made by the al- 
gorithm is 0(log^ n-l-A: log n) . Below, we estimate the error probability of Select. 
There are two possible causes of error: (1) At some phase (say. Phase i), Select 
determines the ith largest element of X incorrectly, and (2) Select returns 
“failure” . 

Lemma 6. Let p be the probability that at some phase, say, Phase i. Select 
determines the ith largest element of X incorreetly. Then p = 0{n~‘'). 

Proof. Consider Phase i. Recall that I denotes the indices of elements not yet 
reported. Select determines incorrectly some Xg to be the ith largest element 
only if Select receives a 0 from the function call FindLarger(7, ig), but Xg is 
not the maximum element in Xj. From Lemma 5, this occurs with probability 
1/2^ = Since there are at most k phases, we have p = 0{n~‘'). 

Next, we estimate the probability that Select returns “failure”, or equiva- 
lently, the probability that count = rj. Let m be the number of indices left in 
the stack S when Select terminates. Let A be the number of pop operations 
executed by Select. Since a pop operation is executed only after Select outputs 
some Xg, we have A < k. The number of push operations executed by Select 
is m + A. Every time after Select calls FindLarger, it performs a push and/or 
pop operation. Thus, count < m + 2A < m + 2k. 

Prob [count = * 7 ] < Prob [m + 2k> 7clogn -F 2k] 

< Prob [m > 7c log n] 
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Lemma 7. Proh[m > 7clogn] = 0{n 

Proof. Assume that when Select terminates, the sequence of indices left in 
the stack S is (so,si,... ,Sm-i), where Sm-i is at the top. Consider any j e 
{ 0 , 1 , • • • , m — 1 }. At the point just before sj is computed, let Ij denote the set of 
indices of elements not yet reported by Select. By definition, 
Im-i C Im -2 C • • • C /g. In fact. Select computes sj by calling the function 
FindLarger(/j,Sj_i). Thus, Sm-i > Sm-2 > ■ ■ ■ > sg. Recall that FindLarger is 
constructed in such a way that every index in the set Lj = {h G Ij \ Xt > } 

has equal probability to be reported as Sj. We say that sj makes a big advance 
if Xsj is larger than half or more of the elements in otherwise a small ad- 
vance. Obviously, the probability that Sj makes a small advance is at most 1 / 2 . 
Among sg,si,... ,Sm-i, there are at most [lognj indices giving big advances; 
we denote m' as the number of indices making small advances. Intuitively, the 
probability that m is large is very small because many small advances must be 
involved, yet each has probability at most 1/2. 

Prob [m > 7c log n] < Prob [m! > 7c log n — logn] 

^ / 7clogn \ 

“ \7c log n — logn/ \2/ 

^ / e-7clogn y°g" / 

- V logn ) V2j 

= ^log(e-7c)-(7c-l) 

< n“'^ (assume c > 1) 

The following theorem follows from Lemmas 6 and 7. 

Theorem 2. The algorithm Select finds the k largest elements of n distinet 
real numbers with error probability 0{n~'^). 

4 Remarks 

If only pairwise comparisons are allowed, any randomized algorithm that finds 
the maximum of n numbers using oin) comparisons must have error probabil- 
ity 1 — o(l). By Yao’s Minimax Principle [10], this follows from the following 
proposition. 

Proposition 1. Let A be any deterministic algorithm that finds the maximum 
of n numbers using h = o(n) eomparisons. For a random input X , the probability 
that A returns an incorrect answer is 1 — o(l). 

Proof. By making h comparisons, A can examine at most 2h elements of X. 
Each of the remaining n — 2h elements can be the maximum element with prob- 
ability 1/n, and A can return only one of them. Thus, the probability that an 
incorrect answer is returned is at least (n — 2h — l)/n, which is 1 — o(l) as 
h = o{n). 
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Abstract. We introduce randomized A-dimensional binary search trees 
(randomized Ad-trees), a variant of A-dimensional binary search trees 
that allows the efficient maintenance of multidimensional records for any 
sequence of insertions and deletions; and thus, is fully dynamic. We 
show that several types of associative queries are efficiently supported 
by randomized Ad-trees. In particular, a randomized Ad-tree with n 
records answers exact match queries in expected C)(logn) time. Partial 
match queries are answered in expected time, when s out 

of A attributes are specified (with 0 < f{s/K) < 1 a real valued func- 
tion of s/K). Nearest neighbor queries are answered on-line in expected 
C)(logn) time. Our randomized algorithms guarantee that their expected 
time bounds hold irrespective of the order and number of insertions and 
deletions. 



1 Introduction 

The associative retrieval problem consists of answering queries with respect to 
a file of multidimensional records. Each multidimensional record contains a A- 
tuple of values that constitute its A-dimensional key and associated data. The 
entries of a A-dimensional key are referred to as attributes. An associative query 
specifies a certain condition about the keys and requires the retrieval of records 
in the file whose keys satisfy the given condition (examples of associative queries 
are intersection queries and nearest neighbor queries) . 

Several data structures for building information retrieval systems support 
associative queries and offer different trade-offs to the type of associative queries 
for which they are efficient [6,12]. Their space requirements, worst-case and 
expected-case performance on a range of operations as well as their design make 
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them more or less suitable for the dynamic maintenance of a file. The use oi K- 
dimensional binary trees [2] (or simply i^d-trees) is a convenient device for sup- 
porting associative queries because it usually supports a large set of operations 
with relatively simple algorithms and offers reasonable compromises on time and 
space requirements. 

The expected-case analysis for ifd-trees is made under the assumption that 
the ifd-tree is built by successive insertions of records with if-dimensional keys 
independently drawn from a random source (say, uniformly distributed ran- 
dom itT-tuples in [0,1]^) [7]. We say that a i^d-tree built in this way is a 
randomly built ifd-tree. For randomly built TFd-trees of size n the expected 
time for an insertion, a deletion or an exact search of a record is O (logn) [2]. 
Intersection queries and nearest neighbor queries are also supported by this data 
structure [2,3,4,13]. The efficient expected case performance of these operations 
holds only under the assumption that the Kd-tree is randomly built. Unfortu- 
nately, this assumption does not always hold: for instance, it fails when the keys 
to be inserted are sorted with respect to one of the attributes. Moreover, an 
alternation of deletions and insertions over a randomly built Tfd-tree destroys 
randomness, in the sense that the resulting tree is no longer a randomly built 
tree. This happens even if every item of the file is equally likely to be deleted. 
After some updates the tree may need to be rebuilt to preserve the logarithmic 
expected performance. 

One possibility to overcome this problem is the use of optimized 
Aid-trees [4,13], assuming that the file of records is given a priori. Such Aid- 
trees are perfectly balanced and their logarithmic performance in dictionary 
operations is guaranteed. However, when insertions and deletions are to be per- 
formed, a reorganization of the whole tree is required. Thus, this alternative is 
not suitable unless updates occur rarely and most records in the file are known in 
advance. Another approach is to introduce explicit constraints on the balancing 
of the trees, as in dynamically balanced Aid-trees and in divided A'd-trees [8,14]. 
Each update operation checks whether a predefined balance criterion remains 
true. If thresholds for balance criteria are exceeded, then a complex reorga- 
nization of the tree is performed. Unfortunately, these methods sacrifice the 
simplicity of the standard Aid-tree algorithms. 

A third approach consists in the use of randomization to produce simple yet 
more robust algorithms. Randomization guarantees efficient expected perfor- 
mance that no longer depends on assumptions about the input distribution [10]. 
Randomization has been successfully applied to the design of dictionaries by 
Aragon and Seidel [1], Pugh [9] and Martinez and Roura [11]. 

In this paper we present randomized AT-dimensional binary search trees (ran- 
domized Kd-trees, for short), a new type of AT-dimensional binary trees that 
(1) support any sequence of update operations (insertions and deletions) while 
preserving the randomness of the tree, (2) do not demand preprocessing and 
(3) efficiently support exact match and associative queries. The main idea is 
to use randomization for performing insertions and deletions in regions of A'd- 
trees other than the leaves in such a way that the resulting tree is a randomly 



Randomized if-Dimensional Binary Search Trees 



201 



built tree. As a consequence, the expected case performance of every operation 
holds irrespective of the order of update operations since it only depends on the 
random choices made by the randomized update algorithms. A full version of 
this paper is accessible from: http://www.lsi.upc.es/~techreps/1998.html. 



2 Relaxed l^d-trees 

Multidimensional binary trees (or ATd-trees) [2] are a generalization of the well 
known binary search trees (BSTs), that handles records with keys of K at- 
tributes. In what follows and without loss of generality, we identify a record 
with its corresponding key as x = where each a;W, 1 < 

i < K, refers to the value of the i-th attribute of the key x. The domain 
D = Di X D2 X ... X Dk of the keys is AT-dimensional and each Di is a to- 
tally ordered set. 

Definition 1. A ATd-tree for a set of keys is a binary tree in which: 

1. - Each node contains a K -dimensional key and has an associated discriminant 
je{i,2,...,i^}. 

2. - For every node with key x and discriminant j , any key y in the left subtree 

satisfies y^^'^ < and any key y in the right subtree satisfies y^^'^ > . 

3. - The root node has depth 0 and discriminant 1. All nodes at depth d have 
discriminant (d mod K) 1. 

Note that, if AT = 1, then a ATd-tree is a binary search tree. As for binary search 
trees, it is assumed that each domain Di is a continuous set and therefore the 
probability of having two equal values of the z-th attribute is zero [2,7]. Thus, 
any key x to be inserted is always compatible with a tree T in the sense that, 
for any z, its z-th attribute is different from the z-th attribute of any other key 
already in T. 

The cyclic restriction of ATd-trees (Condition 3 in Definition 1) forces update 
operations to be very laborious or located at the leaves. But, as we shall show, 
in our quest for randomized i^d-trees, we add the flexibility of performing oper- 
ations in places of ATd-trees other than the leaves without major reorganization. 

With this purpose in mind, we now introduce relaxed i^d-trees. These are 
Aid-trees where Condition 3 in Definition 1 is dropped but where each node 
explicitly stores its discriminant. So, the sequence of discriminants in a path 
from the root to any leaf is arbitrary. 

Definition 2. A relaxed ATd-tree for a set of keys is a binary tree in which: 

1. - Each node contains a K -dimensional key and has associated a discriminant 

j€{l, 2,. 

2. - For every node with key x and discriminant j, any key y in the right subtree 
satisfies y^^'^ < x^^'^ and any key y in the left subtree satisfies z/^-1^ > x^^'^ . 

Again, if AT = 1, a relaxed A'd-tree is a binary search tree. Now, we say that 
a relaxed Aid-tree of size n is randomly built if it is built by n insertions where 
the keys are drawn independently from a continuous distribution (for example. 
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uniformly from [0, 1]^) and the discriminants are drawn uniformly and indepen- 
dently from {1,... ,K}. In randomly built relaxed Xd-trees, this assumption 
about the distribution of the input implies that the nIA'" possible configura- 
tions of input file and discriminant sequence are equiprobable (observe that in 
randomly built Aid-trees the assumption implies that all n\^ input sequences 
are equiprobable) [2,3,7]. 

In particular, in a randomly built relaxed ATd-tree each of the nK possibilities 
of (key,discriminant) pairs are equally likely to appear in the root and once the 
root is fixed, the left and right subtrees are independent randomly built relaxed 
Kd-trees. Thus, we obtain the same distribution for the shapes of randomly 
built relaxed TCd-trees as for randomly built BSTs and randomly built Kd- 
trees. Moreover, given that j keys are in the left subtree, the distribution of 
randomly built relaxed TCd-trees of size n has the recursive property that the 
distribution of left and right subtrees is the distribution of randomly built relaxed 
Aid-trees with sizes j and n — 1 — j, for all 0 < j < n, and the probability that 
the left subtree has size j is simply 1/n, independently of K. 

Parameters like the internal or the external path length do not depend on 
the discriminants [6,7] and hence their expected value is the same for randomly 
built BSTs and randomly built relaxed Aid-trees. 



Theorem 1. [6] The expected internal and external path length of a randomly 

built relaxed Kd-tree is 0{nlogn). 



Therefore, randomly built relaxed Aid-trees perform like randomly built Aid- 
trees for exact match queries. Such an operation explores a path, that, because 
of Theorem 1 is logarithmic in the expected case. 

The algorithms for associative queries in relaxed A'd-trees are similar to 
those for Aid-trees but their expected-case performances are not necessarily the 
same (partial match, for instance), since, for randomly built relaxed A'd-trees a 
path down the tree does not have a cyclic pattern of discriminants but a random 
sequence of discriminants. We now analyze the average cost of performing partial 
match and nearest neighbor queries in relaxed Aid-trees. 

A partial match query is a query in which only s out of the K attributes of a 
given key are specified (with 0 < s < Ai). The retrieval consists of finding all the 
records whose specified key attributes coincide with those of the given query. 

A partial match query in a relaxed Aid-tree explores the tree in the following 
way. At each node it verifies the corresponding discriminant. If it is specified 
(which happens with probability s/K), then it follows recursively only one of 
the subtrees depending on its attribute. Otherwise (with probability 1 — s/K), 
it follows the two subtrees recursively. Hence we have the following theorem. 
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Theorem 2. The expected cost P^, measured as the number of comparisons, of 
a partial match query with s out of K attributes specified in a randomly built 
relaxed Kd-tree with n nodes, is Pn = (3n^ + O (1), where 

i = i(./A-) = -i + ~ 1 - |1 + O (WAf ) , 

. T(2^ + l) 

^ {l-s/K){6+l)P^{5+iy 

with A{x) = V9 ~ 8x and P{x) the Gamma function [5]. 

The proof of this theorem appears in the full version of this paper. The cost is 
obtained by solving a recurrence by means of ordinary generating functions and 
complex singularity analysis. 

The expected cost of partial match queries in randomly built relaxed K di- 
trees is slightly higher than the one given by Flajolet and Puech [3] for randomly 
built i^d-trees. Notice also that the constant (3 in the main order term of the 
expected cost of partial matches in relaxed iFd-trees is independent of the pat- 
tern of specified/imspecified attributes, whereas for standard /Td-trees such a 
constant is dependent on the particular pattern of the query [3] . 

A request for the closest record in a file to a given query record, under a 
determined distance function, is called a nearest neighbor query. More generally, 
given a multidimensional record q, a distance d over D, and a relaxed Aid-tree T 
of n records, an m-nearest neighbor query consists of finding the m points y in T 
such that d(q, y) < d(q, x), for all x but m — 1 of the records in T. The nearest 
neighbor query is the case to = 1. 

Theorem 3. The expected cost (measured as the number of visited nodes) of 
a nearest neighbor query in a randomly built relaxed Kd-tree with n nodes, 
is O (log n) time. 

Sketch of the Proof. The proof is similar to the proof given by Friedman, 
Bentley and Finkel [4]; it is only required to observe that the expected volume of 
a cell in the partition induced by randomly built relaxed ATd-trees is l/(n-|-l).n 

3 Randomized Rd-trees 

Relaxed A'd-trees have been shown to efficiently support dictionary operations 
and associative queries in the expected-case. However, we still have the assump- 
tion that the input sequences of keys are independently drawn from any con- 
tinuous probability distribution. In what follows we drop this requirement via 
randomized algorithms. The randomized algorithms presented here require that 
each node stores the size of the subtree beneath it, because the behavior of our 
randomized operations depends on the size of the subtrees to which they are 
applied. 

We say that a relaxed Aid-tree is a randomized Kd-tree if it is the result of a 
sequence of update operations performed by means of the randomized algorithms 
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introduced below, applied to an initially empty tree. We shall show in this section 
that any tree obtained this way is a randomly built relaxed Aid-tree. 

Informally, in order to produce a randomly built relaxed Aid-tree two proper- 
ties must hold: (1) a new inserted key should have some probability of becoming 
the root of the tree, or the root of one of the subtrees of the root, and so forth 
and (2) every discriminant should have the same probability of being the dis- 
criminant of the new node. 



The insertion of x in a relaxed Aid-tree T, begins by generating uniformly 
a random integer i G {!,... ,k}. This step corresponds to the assignment of a 
discriminant to the new node. If the tree T is the empty tree, then the algorithm 
insert produces a tree with root node (x, i) and empty left and right subtrees (we 
say that a node contains (x, z) if its key is x and its discriminant is i). 



If the tree T is not empty, then, with probability the (key, discriminant) 
pair (x, i) must be placed at the root of the new tree using the insert_at_root algo- 
rithm (since the new tree will have size n-l- 1). Otherwise, we insert the pair (x, i) 
recursively in either the left or the right subtree of T depending on x’s order rela- 
tion with the root of T. Let T have root (y, j), left subtree L and right subtree R. 
Symbolically we have, with probability insert (T, x) = insert_at_root (T, x, z), 
and, with probability 




if 

if 



The algorithm insert requires the possibility of inserting the pair (x,z) at the 
root of any subtree of a relaxed ATd-tree T (using insert_at_root). If T is empty, 
then, insert_at_root (r, X, z) gives as a result a tree with root node x, discrimi- 
nant i and empty left and right subtrees. When T is not empty, by definition, 
insert_at_root (T, X, z) = T' where the root of T' is (x,z), its left subtree consists 
of all the elements of T with z-th dimensional key smaller than and its right 
subtree contains those elements of T with z-th dimensional key greater than 
To obtain the left and right subtrees of T' we use the split algorithm, which we 
present later. 

The deletion of a record from a randomly built relaxed ATd-tree consists of 
searching in the tree the key x to be deleted and then joining its corresponding 
left and right subtrees. Informally, in order to keep a randomly built relaxed 
A'd-tree all the nodes in these subtrees should have some probability of taking 
the place of the deleted node. This is achieved by the Join algorithm (which 
we introduce later). Let x be the record to be deleted from the randomly built 
relaxed ATd-tree T. Let T have root (y,j), left subtree L and right subtree R. 
Symbolically, 
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/ 



delete (J 




■R 



if X 7^ y and 



delete (T, x) = < 



L delete (_R,x) 
Join(T,_R,i) ifx=y. 




if X 7^ y and 



Both, insertions and deletions, consist of two different steps. A first step in which 
one must follow a path in the tree in order to locate the place where the key 
must be inserted or deleted and, a second step in which the update is performed. 

The expected cost of insertions and deletions is C>(logn) time, since both the 
insertion at the root of any element and the deletion of any element occur near 
the leaves on the average. Thus, the reconstruction step, although expensive, 
occurs in subtrees of expected size C>(logn) [ 2 ]. 

The split algorithm receives as parameters a tree T, a key x and a discrimi- 
nant z, and produces two different trees: T<^ = split< (T,x,i) containing all the 
elements of T with i-th attribute smaller than and T>. = split> (T, x, i) 
containing all the elements of T with i-th attribute greater than 

The algorithm split< works in the following way: When T is the empty tree, 
the split< algorithm returns the empty tree. Otherwise, let T have root (y,j), 
left subtree L, and right subtree R. We have four cases to consider: (1) If z = j 
and z/^®^ < x^’’\ then y belongs to T<^, all the elements of L do as well (since 
each z-th dimensional key in L is smaller than z/^®) and thus smaller than 
and the operation proceeds recursively in R to complete the result; (2) If z = y 
and z/^®^ > x*^®\ then the algorithm proceeds recursively in L (since y and all the 
elements in R are greater than x^®^); (3) If z 7 ^ y and z/*-®^ < x^®\ then y belongs 
to r< . and the algorithm proceeds recursively in L and in R and attach to y the 
results of splitting L and R; (4) If z 7 ^ y and z/*^®^ > x^®\ then y is not in the tree 
and the algorithm must proceed recursively in L and R, but now it has to join 
the trees resulting from splitting L and R. Symbolically, if T is not empty. 




if i = y and z/^®) < 



split< {L, X, z) 



if z = y and z/^®) > x^®) 



split< (T, X, z) = < 



split< {L, X, z) split< {R, X, z) 

join (split< (L, X, z) , split< (R, x, z) , y)if z 7^ y and z/^®) > x^®) 




if z 7^ y and z/^®) < x^®) 



The split> operation is defined symmetrically. It is not difficult to see that both 
split< (T, x,z) and split> (T, x,z) compare x against the same keys in T as if we 
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were performing a partial match with one attribute specified. But the additional 
cost of the join algorithm must be taken into account. 

We now describe the algorithm Join (A, j). By definition this algorithm is 

applied only when the j-th attribute values of the tree A are smaller than the j- 
th attribute values of the tree B. Informally, in order to produce a randomly 
built relaxed ifd-tree, each node of A and each node of B must have some 
probability of becoming the root of the new tree. If A and B have sizes n and m 
respectively, then, T = Join {A,B,j) has size n + m. Thus, the Join algorithm 
selects with probability the root of A, (a, ia), as root of the resultant tree T 
and with complementary probability the root,(b, ih), of B. We have the following 
three cases to consider: (1) If A and B are both empty, then, T = Join (A,B,j) 
is the empty tree; (2) If only one of them is empty, then T is equal to the non- 
empty one; (3) If A and B are both non-empty trees with roots (a, ia), (b,ib), 
left subtrees La, Li, and right subtrees Ra and Rt, respectively, then we face 
four subcases. Two subcases correspond to the situation where (a, *„) has been 
chosen as the root of T: (3a) If j = ia, then the left subtree of T is La and its 
right subtree is the result of Joining Ra with B (since every j-th dimensional 
key in B is greater than a^®“^); (3b) If j ^ ia, then the left subtree of T is the 
result of Joining La with split< (B, a, ia). Symmetrically, the right subtree of T is 
the result of Joining Ra with split> {B,a.,ia). The two complementary subcases 
occur when (b,ih) is selected as the root of T. Symbolically, with probability 



And with probability , when (b,ih) is selected as the root of T, we obtain 
symmetrical equations. 

A Join traverses the tree in a similar way than the partial match algorithm 
with one specified attribute, with the additional cost of the split algorithm. 

The randomized split and Join algorithms preserve the randomness of their 
input (because of Lemma 1 below). In other words, when applied to randomly 
built relaxed Lfd-trees, both the split and the Join algorithms produce randomly 
built relaxed iLd-trees. Moreover, since this happens, the insert and delete algo- 
rithms when applied to randomly built relaxed iLd-trees produce randomly built 
relaxed iLd-trees. These claims are made explicit in Theorems 4 and 5 below. 

Lemma 1. Let T be a relaxed Kd-tree with root (y,j) and let T<^ and T>. be 
the relaxed Kd-trees produced by split< (T, x, i) and split> (T, x, z), respectively, 
where x is any key compatible with T. Let T' be the relaxed Kd-tree produced by 
\6\n{A,B,j), where A and B are subtrees of T . IfT is a randomly built relaxed 
Kd-tree, then T<^ and T>. are independent randomly built relaxed Kd-trees. 
Also, if T is randomly built then T' is a randomly built relaxed Kd-tree. 





if i = ia, 



Join {A,B,j) = < 



Join (Ta,split< (B,a, J) ,j) 




Join (i?a,split> (B,a,ia) , j) 
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Sketch of the Proof. The proof is by induction on the size of the tree and can 
be found in the full version of this paper. □ 

Theorem 4. If T is a randomly built relaxed Kd-tree that contains the set of 
keys X and x is any key compatible with X , then insert(T, x) returns the randomly 
built relaxed Kd-tree containing the set of keys X U {x}. 

Proof. The proof is by induction on the size n of T. If n = 0, then T is the empty 
tree (a randomly built relaxed Jfd-tree), and insert(T, x) returns a randomly built 
relaxed /fd-tree with root (x,i), and two empty subtrees, where i is uniformly 
generated from {!,... ,K}. We assume now that T is not empty and that the 
theorem is true for all sizes < n. Consider an item (y,j) S T. The probability 
that (y, j) is the root of T before the insertion of x is since T is a randomly 
built iTd-tree. The probability that (y,j) is at the root after the insertion of x 
is the probability that x is not at the root and (y,j) was at the root, which is 
X resulting in the desired probability. Moreover, if (x,i) is not inserted 
at the root at the first step, the insertion proceeds recursively in either the left 
or the right subtrees of T. By the inductive hypothesis, this result in a randomly 
built relaxed ifd-tree. Finally, with probability (x,i) is inserted at the root 
of T. Since i is the discriminant of x with probability the resulting probability 
is K(n+i) expected, and because of Lemma 1 the two subtrees attached to 
the new root are randomly built relaxed iLd-trees.D 

Theorem 5. If T is a randomly built relaxed Kd-tree that contains the set of 
keys X, then, delete (T,x) produces a randomly built relaxed Kd-tree T' that 
contains the set of keys X\{x}. 

Proof. If the key x is not in T, then the algorithm does not modify the tree, 
which is randomly built. Let us now suppose that x is in T, this case is proved 
by induction on the size of the tree. If n = 1, x is the only key in the tree 
and after deletion we obtain the empty tree which is a randomly built relaxed 
Lfd-tree. We assume that n > 1 and that the theorem is true for all sizes < n. 
If x was not the key at the root of T we proceed recursively either in the left or 
in the right subtrees, and by inductive hypothesis we obtain a randomly built 
subtree. If x was the key at the root of T, the tree after deletion is the result of 
joining the left and right subtrees of T, which produce a randomly built relaxed 
iLd-tree because of Lemma 1. Finally, after deletion, each node has probability 
of being the root. Observe that this is the desired probability since the 
new tree has size (n — !).□ 

Combining these two theorems we obtain the following important corollary. 

Corollary 1. The result of any arbitrary sequence of insertions and deletions, 
starting from an initially empty tree is always a randomly built relaxed Kd-tree. 

It is important to observe that using these new update operations we have that 
the expected running time for insertions, deletions, partial match and nearest 
neighbor queries depends only on the random choices made by the randomized 
update algorithms and not on the sequence of updates. 
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4 Final Remarks 

Throughout this work we have presented a randomized generalization of Kd- 
trees (the randomized ifd-trees) that inherits the simplicity, robustness, flexi- 
bility and efficiency of standard ifd-trees, yet essentially preserves (in terms of 
the number n of records) the expected performance of all the operations (up- 
dates and associative queries) of ifd-trees independently of the order in which 
updates (insertions and deletions) are performed. None of the update operations 
or queries require preprocessing. Finally, randomized iFd-trees are also efficient 
in space (it is 0{n)), requiring slightly more space than standard TFd-trees, as 
the randomized iFd-trees need to store, for each node, its discriminant and the 
size of the subtree beneath it. 
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Abstract. The main contribution of this work is to propose efficient 
randomized leader election protocols in Packet Radio Networks (PRN). 
We show that in a one-channel PRN a leader can be elected among n 
identical stations in O(loglogn) broadcast rounds with probability at 
least 1 — or in O(logn) broadcast rounds with probability at 

least 1 — 0(i). We begin by designing such a protocol for the case where 
the number n of stations is known beforehand. Next, we show that the 
leader election can be completed within the same number of broadcast 
rounds even if n is not known. We show that our protocols are optimal 
in the sense that no randomized protocol that terminates in o(log n) 
rounds can elect a leader with probability higher than 1 — 0(i) and no 
randomized protocol that terminates in o(loglogn) rounds can elect a 
leader with probability higher than 1 — 



1 Introduction 

Let 5 be a set of n radio transceivers in the Packet Radio Network (PRN), 
henceforth referred to as stations. The stations are identical and cannot be dis- 
tinguished by serial or manufacturing number. The problem that we address in 
this work is the well-known leader election problem which asks to designate one 
of the station as leader [2,9]. In other words, after executing the leader elec- 
tion protocol, exactly one station learns that it was elected as leader, while the 
remaining stations learn the identity of the leader elected. 

As customary, the time is assumed to be slotted and all the stations have 
a local clock that keeps (synchronous) time [3]. Also, all broadcast operations 
are performed at time slot boundaries. The stations are assumed to have the 
computing power of a usual laptop computer; in particular, they all run the 
same protocol and can generate random bits that provide “local data” on which 
the stations may perform computations. 

The stations communicate using k communication channels denoted by C(l), 
C(2), . . ., C{k). At any one time, a station can tune in to one such channel and/or 
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broadcast on at most one (possibly the same) channel. A broadcast operation 
involves a data packet whose length is such that the broadcast operation can 
be completed within one time slot. In any given time slot the status of a radio 
channel is: 

— NULL if no station broadcasts on the channel in the current time slot 

— SINGLE if exactly one station broadcasts on the channel in the current time 
slot, and 

— COLLISION if two or more stations broadcast on the channel in the current 
time slot. 

Suppose that the PRN has n stations and that the z-th station, (1 < z < n), 
participates in the protocol for broadcast rounds. Note that the z-th station 
need not broadcast data in all of these rounds. The performance of the protocol 
is evaluated by the following two metrics: 

— broadcast complexity r defined as the maximum number of broadcast rounds 
in which any station participates. In other words, r = max{ri, r 2 , . . . , r„}; 

— work complexity w of the protocol defined as zc = ri + T 2 + • • • + r„. 

Clearly, if all the stations participate in the protocol for the same number of 
broadcast rounds we have r = ri = r 2 = • • • = and w = rn. However, in case 
most of the stations terminate earlier, w ^ rn and these stations are available 
to participate in other protocols. 

Given its importance, the problem of designing efficient protocols for PRN 
has been addressed in the literature [2, 3,4, 5]. However, some of the existing 
protocols are designed under the assumption that the n stations in the PRN have 
been assigned consecutive numbers, typically, from 1 to n. The highly non-trivial 
task of numbering the stations in this fashion is often ignored in the literature. 
The leader election problem is even more fundamental since the consecutive 
numbering problem we just mentioned can be solved by repeatedly selecting a 
leader in a subset of the stations. 

It is customary to address the leader election problem in three different sce- 
narios: 

Scenario 1: The number n of stations in the PRN is known in advance; 
Scenario 2: The number n of stations in the PRN is unknown but an upper 
bound u on this number is known in advance; 

Scenario 3: Neither the number of stations nor an upper bound on this number 
is known in advance. 

It is intuitively clear that the task of leader election is the easiest in Scenario 1 
and the hardest in Scenario 3, with Scenario 2 being in-between the two. In 
this work we shall not deal with broadcast protocols satisfying the conditions of 
Scenario 2. 

Several randomized protocols using a one-channel PRN have been presented 
in the literature. Metcalfe and Boggs [7] presented a leader election protocol 
for Scenario 1 that is guaranteed to terminate in 0(1) expected rounds. Later, 



Randomized O(log log n)-Round Leader Election Protocols 



211 



Willard [9] showed that the leader election can be solved in log log m + 0(1) ex- 
pected rounds if the conditions of Scenario 2 hold. Willard’s protocol involves 
two stages: the first stage, using binary search, guesses in log log u rounds a num- 
ber i, {0 < i < logw), satisfying 2* < n < 2®+^. The second stage finds the leader 
in 0(1) expected rounds using the protocol of [7]. Thus, the protocol finds the 
leader in log log m -I- 0(1) expected rounds. Willard [9] also improved this proto- 
col to run under the conditions of Scenario 3 in log log n + o(log log n) expected 
rounds. The first stage of the improved protocol uses the technique presented 
by Bentley and Yao [1], which finds an integer i satisfying 2® < n < 2*+^ for 
unknown n. However, all the stations executing this protocol terminate at the 
same time, implying that the work complexity of the protocol is O(nloglogn). 

The main contribution of this work is to present randomized protocols to solve 
the leader election problem under the conditions of Scenario 3 on a one-channel 
PRN. We begin by showing that, in case Scenario 1 applies, the 0(1) expected 
round protocol of [7] can elect a leader in O(loglogn) broadcast rounds with 
probability at least 1 — 0(j^^) or in O(logn) broadcast rounds with probability 
at least 1 — O(^). Further, we show that this protocol is optimal in the sense 
that no o(log n)-round (resp. o(log log n)-time) randomized election protocol can 
elect a leader with probability 1 — 0(j^^) (resp. 1 — 0(j^^)). 

We then go on to show that the leader election problem can be solved within 
the same bounds even if the conditions of Scenario 3 are satisfied, that is, if 
neither the number n of stations nor an upper bound on this number is known 
in advance. Specifically, we show that the leader election problem can be solved 
in O (log log n) broadcast rounds with probability at least 1 — 0(j;^) or in 
O(logn) broadcast rounds with probability at least 1 — 0(i), even if no station 
knows n at the beginning of the protocol. Our protocols for the case of Scenario 3 
are also optimal, because the lower bound discussed above holds for this case 
as well. We further show that our leader election protocols have an 0{n) work 
complexity. 

We also provide simulation results to show that our randomized 0(log log n)- 
round leader election protocol performs extremely well in practice. In fact, our 
simulation results show that our protocol features a very small proportionality 
factor, being eminently of practical relevance. 

To put our work in perspective, we note that our leader election protocols 
improve on the result of Willard [9] in three respects: First, our protocol termi- 
nate in O(loglogn) broadcast rounds with high probability, whereas the one in [9] 
terminates in O(loglogn) expected number of rounds. Second, our protocol has 
a work complexity of 0{n), as opposed to that in [9] that requires 0(n log log n) 
work. Third, our protocol is more uniform, more intuitive, easier to understand 
and to reason about. 

2 Leader Election for Known n 

This section reviews basic probability theory results that are useful for analyzing 
the performance of our randomized leader election protocols. For a more detailed 
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discussion of background material we refer the reader to [6,8]. We then review 
the simple leader election protocol of [7] for known n and show that this simple 
protocol that terminates in 0(1) expected broadcast rounds can be also seen to 
terminate in O(logn) or O(loglogn) rounds with high probability. Finally, we 
prove the optimality of this protocol by presenting a tight lower bound. 

Throughout, Pr[A] will denote the probability of event A. For a random 
variable X, E\X] denotes the expected value of X. Let X be a random variable 
denoting the number of successes in n independent Bernoulli trials with param- 
eters p and 1 — p. It is well known that X has a binomial distribution and that 
for every r, (0 < r < n), 

Pr[X = r] = (1) 

Further, the expected value of X is given by 

n 

E[X] = r • Pr[X = r] = np. (2) 

r—0 

To analyze the tail of the binomial distribution, we shall make use of the following 
estimates, commonly referred to as Chernoff bounds: 

Pr[X < (1 - e)£;[X]] < (0 < e < 1). (3) 

Pr[X > (1 -k e)E[X]] < (0 < e < 1). (4) 

We now review the simple randomized election protocol of [7] for the case 

of Scenario 1, that is, when the number n of stations is known in advance. The 

details follow. 



Protocol Election-f or-known-n 
repeat 

each station broadcasts on channel C(l) with probability 
until the status of channel C(l) is SINGLE; 

the station that has broadcast in the last round is declared the leader. 



The correctness of Election-f or-known-n is easily seen. The protocol con- 
tinues until, at some point, only one station broadcasts on the channel. Clearly, 
upon this broadcast, the leader is known by all the participating stations. 

Thus, we now turn to the problem of estimating the number of broadcast 
rounds necessary by the protocol to terminate. Let X be the random variable 
denoting the number of stations broadcasting in a given round. Then, by virtue 
of (I), at the end of this round the status of the channel is SINGLE with prob- 
ability 



Pr[X = I] 





1 



> -. 



e 
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To put it differently, any given broadcast round fails to find a leader with prob- 
ability at most 1 — i. 

This first result allows us to observe that the expected number of broadcast 
rounds required by protocol Election-f or-known-n to terminate is bounded by 




Thus, Election-f or-known-n elects a leader in e expected rounds. 

Now, consider the first t rounds in the protocol Election-f or-known-n. The 
probability that none of the first t rounds produces a leader is 

(Pr[X ^ 1])‘ < ^ • (since 1 -I- a; < e“) 

Consequently, if we take^ t = elnn, the probability that Election-f or-known-n 
will fail to elect a leader at the end of O(logn) broadcast rounds is at most 
Similarly, if we take t = elnlogn, the probability that the protocol fails to 
identify a leader at the end of O(loglogn) broadcast rounds is at most 
Thus, we have proved the following result. 

Lemma 1. The protocol Election-f or-known-n returns a leader in e expected 
broadcast rounds. It also returns a leader in 0(log n) broadcast rounds with prob- 
ability at least 1 — 0 (— ) or in O(loglogn) broadcast rounds with probability at 
least 

Consider a one-channel PRN with only two stations. Consider an arbitrary 
leader election protocol V and let us evaluate the number of broadcast rounds 
needed by V to terminate, that is, to break the symmetry inherent in the problem 
of leader election. In the first round, since the two stations are identical, each of 
them decides whether it should broadcast with the same probability p. Then, to 
break the symmetry, one of them should broadcast while the other should not. 
The probability of this event occurring is 2p{l—p) < Thus, a leader is elected 
in one round with probability at most ^ . If the first round does not lead to the 
election of a leader the stations proceed to the second and, possibly, subsequent 
rounds. Since in any broadcast the data received from the channel is the same 
for the two stations, their broadcasting probability in any round must always 
be the same. Thus, in any given round, a leader is elected with probability at 
most It follows that, with probability at least ( 5 )*, the first t rounds fail to 
elect a leader. Therefore, if the protocol V terminates in o(logn) rounds it must 
must fail to find a leader with probability at least (L)°d°s") > Similarly, 

if V terminates in o(log log n) rounds it must fail to find a leader with probability 
at least Thus, we have proved the following important result. 

Theorem 1. No leader election protocol that terminates in o(logn) rounds cor- 
rectly returns the leader with probability higher than 1 — i. Likewise, no leader 

In this work log and In are used to denote the logarithms to base 2 and e, respectively. 
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election protocol that terminates in o(log log n) rounds correctly returns the leader 
with probability higher than 1 — 



3 Leader Election for Unknown n 

The main goal of this section is to show that the performance of the leader 
election protocol devised in the Section 2 can be attained even if the conditions 
of Scenario 3 hold, that is, even if neither the number n of stations participating 
in the protocol, nor or an upper bound on this number, is known in advance. 



3.1 Electing a Leader in Logarithmic Time 

In this subsection we discuss an 0(log n)-round leader election protocol in the 
case where the number n of stations is not known in advance. Initially, all the 
stations participate in the protocol. In each round some of the stations quit and, 
thus, stop participating in the protocol. This continues until only one station 
remains that will become the leader. 



Protocol Logarithmic-leader-election 
repeat 

each station broadcasts on channel C(l) with probability 
if the status of channel C(l) is COLLISION or SINGLE then 
the stations that did not broadcast quit the protocol; 

endif 

until the status of channel C(l) is SINGLE; 

the station that has broadcast in the last round is declared the leader. 



The correctness of protocol Logarithmic-leader-election being easily 
seen, we now turn to the task of evaluating the number of broadcast rounds 
it takes the protocol to terminate. Let N be the number of remaining stations, 
that is the stations still participating in the protocol at the beginning of a given 
broadcast round. Since the status of the channel C(l) in the previous round 
must have been COLLISION, it is the case that N > 2. 

Let X be the random variable denoting the number of remaining stations 
at the end of the current round. We say that the current round is successful if 
1 < X < [^].In this terminology, the probability that the current round is 
successful is 



Pr[l < X < \^^] > i - Pr[X = 0] = i- ^>i 



(since N >2). 



Since a successful round decreases the number of stations by at least half, 
[logn] successful rounds are sufficient to elect a leader. Our plan is to prove 
that, with probability at least 1 — there are at least [logn] successful rounds 
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among the first 16 [log n] rounds. For this purpose, let V be the random vari- 
able denoting the number of successful rounds among 16[logn] rounds. It is 
important to note that 

E\Y] > 16 [log n] • i > 4[logn]. 



Now, the Chernoff bound in (3) allows us to write for 0 < e < 1 

Pr[r < 4(1 - e)[logn]] < Pr[F < {I - e)E[Y]] < < g-2e^riog"l , 

In particular, for e = | we have, Pr[P < [logn]] < e“s Con- 
sequently, we have proved that with probability at least 1 — the first 16 [logn] 

rounds of the protocol Logarithmic-leader-election contain at least [logn] 
successful rounds. In turn, this shows that with probability at least 1 — the 
protocol will terminate in the first 16 [logn] rounds. To summarize, we have 
proved the following result. 

Theorem 2. With probability at least 1 — ^, the protocol Logarithmic-leader- 
election elects a leader in 16 [logn] rounds. 

Our next goal is to show that the work complexity of Logarithmic-leader- 
election is bounded by 0{n). For this purpose, let N be the number of stations 
remaining at the beginning of the current round, and let X be the random 
variable denoting the number of stations that broadcast in this round. Clearly, 
E[X] = now using the bound in (4) with e = 5, we have, Pr[X > ^] < 
< O (-^). Further, Pr[X = 0] = ^ < O (i^)- Thus, with probability at 
least Pr[l < X < ^] = 1 — Pr[X = 0] — Pr[X > ^] > 1 — 0{^), the number 
of stations is reduced by a factor of at least |. 

Let Nt^ {t> 1), denote the number of stations remaining at the beginning of 
the f-th round and let T be the integer satisfying the double inequality 

TL 

Nt+1 < < Nt- (5) 

logn 

Then, for all t, (1 < t < T), the inequality 



Nt+i < 



m 

4 



( 6 ) 



holds with probability at least 1 — O(j^) > 1 — 

Suppose that for all t, (I < t < T), the inequality (6) holds. Then, Nt < 
(I)* ^ n holds. Hence, the inequality (5) guarantees T G O(loglogn). Thus, the 
probability that (6) holds for all t, {1 < t < T), is at least 1 — ^ 

1 — O(-). Further, in this case, the work complexity of the first T rounds in the 
protocol is 







n = 0{n). 
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We just showed that with probability at least 1 — 0(i), the work complexity 
of the first T rounds is bounded by 0{n) and that at most Nt+i < stations 
remain at the end of the T-th broadcast round. Now, by virtue of Theorem 2, 
after the (T + l)-th round, the leader is elected in 16 [log n] — T rounds with 
probability 1 — 0(i). Hence, in this case, the work complexity of the 16 [log n] — T 
rounds is (16 log n — T)x G 0{n). To summarize our findings, we state the 
following important result. 

Theorem 3. With probability at least 1 — 0{^), the work complexity of the 
protocol Logarithmic-leader-election is bounded by 0{n). 

3.2 Electing a Leader in Doubly-Logarithmic Time 

This subsection is devoted to presenting the details of an 0(loglogn)-round ran- 
domized leader election protocol that works under the conditions of Scenario 3, 
that is, when the number n of stations is not known beforehand. The protocol is 
reminiscent of the one developed in the previous subsection. The main difference 
is that the probability of a station to exit the protocol varies in each round, and 
is determined by a variable to, as we are about to specify. The details follow. 



Protocol Doubly-logarithmic-leader-election 
each station sets m <— 0; 

repeat 

each station broadcasts on channel C(l) with probability 
if the status of channel C(l) is NULL then 
each station sets to <— max{m — 1, 0} 

endif 

if the status of channel C(l) is COLLISION then 

all stations that did not broadcast in this round quit the protocol; 
all the remaining stations set to <— to -I- 1; 

endif 

until the status of channel C(I) is SINGLE; 

the station that has broadcast in the last round is declared the leader. 



Clearly, upon termination, the protocol Doubly-logarithmic-leader- 
election correctly returns a leader. To assess the expected behavior of the 
protocol, we now estimate the number of rounds necessary for the protocol to 
terminate. Suppose that at the beginning of a given round there are N stations 
remaining and that the probability that a station broadcasts is p. We shall as- 
sume, for the sake of simplicity, that the number of stations broadcasting during 
this round is exactly the expected number of stations to broadcast. In other 
words, we assume that in each round 



out of the N remaining stations, exactly Np stations broadcast. (7) 
Let T be the integer satisfying 



2 ^ . 2 ^ - 2 ^ • • • 2 ^ 



< n < 2^ -2^ 



( 8 ) 
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It is easy to see that (8) can also be written as: 2^^ < n < 

confirming that T € 6>(loglogn). 

For t > 1, let denote the number of stations remaining at the beginning 
of the t-th round of the protocol. Assuming that n and T are large enough, we 
will evaluate each Nt- Assumption (7) guarantees that in the first round out 
of n stations ^ = j stations broadcast. Thus N 2 = stations remain at the 
beginning of the second round. 

In the second round m = I and by assumption (7) ^ stations 

broadcast. Thus stations remain at the beginning of the third 

round. 

Continuing in this way, Nt = 2 ^^ - 2 ^'^ — ^ 2 t-a stations remain at the begin- 

ning of the t-th round, (I < t < T), and = ^^0 ^ 21 . 222 . ,. 22 t-i stations broad- 
cast. By (8), in the T-th round m = T - 1 and = ^ 20 . 221 . 2 "l.. 22 :t-i < 1 

stations broadcast, and the status of C(I) is NULL or SINGLE. If exactly one 
station broadcasts, the protocol terminates and a leader is elected. Otherwise, 
the status of channel C(l) is NULL and no station quits. Further, m is decreased 
to T — 2, and 

Tl 

Nt +1 = Nt = 22 “ . 22^ . 22^ . . . 22^“^ ’ 

Now (8) and (9) combined allow us to write 



N 



T+l < 



22 “ . 22 ^ . 2 ^^ . . . 



< 2^ 



( 10 ) 



In the (T -|- l)-th round, m = T — 2 and, by assumption (7), stations 

broadcast. If the status of channel C(l) is NULL, that is, if Nt+i < 2^ , 

then m is decreased by one. If the status of channel C(l) is SINGLE, a leader is 
elected and the protocol terminates. Therefore, we may assume that the status 
of channel C(l) is COLLISION. In this case, Nt +2 = ^t -2 stations remain. 
Now, (10) guarantees that 



Nt+2 



Nt+i 



< 2^ 



( 11 ) 



In the (T -|- 2)-th round m = T — 1 and, by assumption (7), stations 

broadcast. Note that equation (11) guarantees that the status of channel C(l) 
must be NULL, no station quits in this round, and Nt+s = Nt+ 2 - 

In the (T -|- 3)-th round m = T — 2 and, by assumption (7), stations 

broadcast. If (11) holds with equality, then exactly one station broadcasts and 
the protocol terminates. Otherwise, the status of the channel must be NULL, m 
is again decreased by one, and Nt +4 = Nt+ 2 . 

In the (T -|- 4)-th round, m = T — 3, and by assumption (7), stations 

broadcast. Note that by (11) it must be the case that 



Nt+5 



Nt+2 / ,.,2^-3 

22^-3 ^ 



( 12 ) 
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Now, (11) and (12) combined imply that three rounds, namely, T + 2, T + 3 
and T + 4, have reduced the number of stations from 2^^ ^ to 2^^ ^ . This argu- 
ment shows that 0{T) rounds suffice to elect a leader. Since T S 0(loglogn), 
we have the following result. 

Lemma 2. Under the assumption that in each round the expected number of 
stations broadcast, the protocol Doubly-Logarithmic-leader-election elects 
a leader among n stations in O(loglogn) rounds. 

Since each station decides to broadcast independently, assumption (7) is not 
entirely realistic. As it turns out, a formal proof of the fact that the protocol 
terminates in O (log log n) rounds without this assumption is much more difficult 
and, due to stringent page limitations, will only appear in the journal version of 
this paper. Nonetheless, we state this important result here. 

Theorem 4. The protocol Doubly-Logarithmic-leader-election elects a 
leader among n stations in O(loglogn) rounds with probability at 
least 1 — 0(i-^). 

Theorem 5. The work complexity of the protocol Doubly-Logarithmic- 
leader-election is 0{n) with probability at least 1 — 0{^). 

Although Theorem 4 guarantees that Doubly-logarithmic-leader- 
election has a very appealing “doubly-logarithmic” performance, it is certainly 
of interest to see how the protocol performs in practice. We refer the reader to 
Table 1 where a number of simulation results are summarized. These results 
show that our protocol has an excellent practical performance, featuring a low 
proportionality constant. The protocol was simulated 1000 times for each num- 
ber of stations in the first row of Table 1. The second row features the average 
number of rounds needed by the protocol to elect a leader. It is very encourag- 
ing to note that even in the case of 100 million stations, the average number of 
rounds required by our protocol to elect a leader is only 10.7. 



Table 1. Simulation of Doubly-logarithmic-election 



7 ^ stations 


10^ 


10-^ 


10^ 


10'‘ 


lO^’ 


10*= 


10' 


10^ 


# rounds 


3.8 


5.4 


7.2 


7.9 


8.6 


9.6 


10.1 


10.7 
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Abstract. Let G be a given graph (modeling a communication network) 
which we assume suffers from static edge faults: That is we let each edge 
of G be present independently with probability p (or absent with fault 
probability f — 1—p). In particular we are interested in robustness results 
for the case that the graph G itself is a random member of the class of 
all regular graphs with given degree d. 

Here we deal with expansion properties of faulty random regular graphs 
and show: For d > 42, fixed and p — K,/d ,k > 20, a random regular 
graph with fault probability f = 1 — p contains a linear-sized subgraph 
which is an expander almost surely. This subgraph can be found by a 
simple linear-time algorithm. 



Introduction 

Modern multiprocessor architectures and communication networks compute over 
structured interconnection graphs like meshes. Here several applications share 
the same network while executing concurrently. This may of course lead to un- 
availability of links and nodes in certain cases and we may assume to compute 
over a subnetwork being randomly assigned by the operating system. Moreover, 
this subnetwork may suffer from edge or node faults. Our work addresses ro- 
bustness properties in case the subnetwork is a random regular graph suffering 
from edge faults. Random regular graphs make an (at least theoretically) popu- 
lar choice because they combine low degree with high expansion almost always 
(see [AlSp 92] for an introduction to expansion and [Bo 85] to random regu- 
lar graphs). In case of structured networks like the butterfly or the hypercube 
it is known how to simulate the non-faulty network on the faulty one with a 
well-determined slowdown, for example [Le et al. 92] ,[Ta 92] . In [Ka et al. 94] 
random graph concepts and techniques are applied to the butterfly. 

Our study continues the work begun in [Ni et al. 94], [NiSp 95, Go 97] which 
are the only papers known to us which investigate random regular graphs with 
edge faults. The paper [Ni et al. 97] contains an application of results on faulty 
random regular graphs to fat trees, a class of interconnection networks with uni- 
versal simulation properties. In [Go 97] we show that p = l/d — I is a threshold 
probability for the existence of a linear- sized component in faulty random regu- 
lar graph. Note that a linear- sized component is an absolute necessity in order 
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to simulate the non-faulty network on the faulty one with only constant slow- 
down (Slowdown is determined as the fraction of the time of the simulation on 
the faulty network and the time needed on the non-faulty network.) But, to 
achieve only constant slowdown, we need more than just a linear-sized compo- 
nent: We need (at least) a linear-sized subgraph which preserves the expansion 
properties (cf. fact 1) which are crucial for efficient communication in non-faulty 
random regular graphs. Recently, see [Ku 97], processing networks with faulty 
edges have been investigated from the point of view of message routing. In this 
context faultiness corresponds to congestion of an edge. 

The interesting paper [NiSp 95] shows that the first eigenvalue of the (or ”a” or 
’’any”, the uniqueness is not really known) linear-sized component of a faulty 
random regular graph is bounded away from the average degree. This implies 
good expansion properties, but does not directly give us an expanding subgraph, 
necessary for the above mentioned simulation. In section I we show, that the 
linear-sized component itself is no expander. (Note: component is a maximal 
connected subgraph.) This motivates our work to actually find a linear-sized ex- 
panding subgraph. In section 2 we present a simple edge deletion process which 
will give us a linear-sized subgraph which is an expander with high probability if 
the fault probability is not too high. Our edge deletion process iteratively deletes 
all edges incident with nodes of degree < 2. It thus finds the 3-core (=unique 
maximal subgraph where each node has degree > 3) of the faulty graph. Due to 
its randomness properties this core is an expander with high probability. In the 
subsequent sections 3,4 a probabilistic analysis of this algorithm is performed. It 
is inspired by the analysis in [BrFrlJp 93] of a simple algorithm for 3-satisfiability. 
The tight analysis of an edge deletion process similar to ours for random graphs 
without degree bound [PiSpWo 96] cannot directly be transferred to the present 
situation because the crucial differential equations (5.7) need to be modified 
somehow to reflect the degree bound d. 

In view of [PiSpWo 96] the intuition of our analysis is: In general random graphs 
with edge probability c/n, c a constant, the degree of a given node is approxi- 
mated by the Poisson distribution with mean c.When c is large enough we get a 
linear-sized 3-core. In the case of faulty random regular graphs we get the bino- 
mial distribution with parameters d, p = n/d. for the degree of a given node. 
The Poisson distribution is the limit of this binomial distribution when d gets 
large. Full proofs can be found in [Go 98]. 

1 Random Regular Graphs and Configurations 

The probability space of random regular graphs with edge faults is given by 
the following probabilistic generation procedure: 1. Choose a d-regular graph 
G = {V, E) where V = {ui, . . . ,u„} according to the uniform distribution. To 
ensure the existence of such graphs we will always assume that d ■ n is even. 2. 
Delete randomly K edges from G. Each set of K edges is equally likely to be 
deleted. For the rest of this paper we make the following notational conventions: 
The degree d of our graphs is fixed. The number of nodes is n. We assume that n 
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gets large, and is such that nd is even. We let N = nd/2. The number of non- 
faulty edges is L = n/d ■ dn/2 = hl ■ n/2, where k is a constant independent 
of d. We always assume d > k. Then K = N — L is the number of faulty edges. 
The standard tool to deal with random regular graphs are random configura- 
tions ([Bo 85], p. 47ff). Random regular graphs with edge faults are dealt with 
by random configurations with edge faults [Ni et al. 94]: Here we have n dis- 
joint sets Wi, . . . ,Wn where each Wi consists of d elements. We let always 
W = Wi U . . . U Wm the Wi are called classes, the elements of the Wi 
are nodes, sets {v, w} where v, w G W, v ^ w are called edges. A config- 
uration is a partition of W into N = dn/2 edges. We have {2N — 1)!! = 
(2N — 1) • (27V — 3) • • • 3 • 1 = (27V)!/(7V! • 2^) configurations. A random con- 
figuration with edge faults is given by the following probabilistic procedure: 1. 
Choose a configuration F from the uniform distribution. 2. Delete randomly a 
set A C r of K edges from F. We represent this procedure as a probability 
tree where each leaf has the probability 1/(27V— 1)!! • We mark each leaf 

of the tree with the corresponding set F of L = N — K non-faulty edges. The 
probability of <P is the sum of the probabilities of the leaves representing F. We 
call this probability space Con(L, n). To show that a property holds for almost 
all faulty graphs, i. e. simply with probability tending to 1, we can show the 
analogous result for faulty configurations (cf. [Bo 85], p. 47 ff). 

We use the following notions of high probability: ’’Almost surely” or ’’with 
high probability” or ’’almost all” means that the probability goes to 1 when n 
goes to infinity. ’’Quite surely” means, for any constant a > 0 the probability 
is > 1 — 0(l/n“) for all sufficiently large n. 

A slightly different notion of faulty random regular graphs and configurations 
is considered in [Go 97, Ni et al. 94, NiSp 95], where step 2 of our probabilis- 
tic generation procedure is replaced by:2. Delete each edge independently with 
probability 1 — n/d. These models are essentially equivalent (cf. the analogous 
situation for random graphs [Bo 85], p.33). The following fact is from [Bo 88] 
(part(a)) and from [Go 97] (part(b)). 

Fact 1 (a) There is a constant (independent of d and n) c > 0 such that for a 
random configuration F the following holds with high probability: For all sub- 
sets X of classes we have ] JV{X) \> c ■ Min{] A j, j Cpl X ]}, where Af{X) is 
the set of classes adjacent to X in F but not belonging to X. Hence, random 
configurations and random regular graphs are (c-)expanders, with high proba- 
bility. The maximal such c is called the expansion constant of F. 

(b) If p = n/d > \/{d — 1) then the following holds in a faulty random regular 
graph (proved in the independent edge deletion model above): There is a con- 
stant e = £(k), such that almost surely a faulty configuration has a connected 
component having at least e • n classes. 

Theorem 2 shows that there exists a linear-sized component which as a whole 
is no expander. Note that we do not know yet that we have only 1 linear-sized 
component. (However, this is likely by random graph experi- 
ence [Ka et al. 94, AlSp 92]). 
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Theorem 2. Let k be an arbitrary constant. With high probability a faulty 
random configuration where 1 > p = n/d > \/{d— 1 ) has a linear- sized 
component, whose expansion constant is < 1 /fc 

Proof: We employ the ’’double randomization trick”, e. g. [Go 97]. Let e > 0 
be small enough such that p'=p — e=^> Let 

p" = 1 — — ^ — - — > 0 then (1 — p') ■ (1 — p") = 1— p (note p" < 1 ). 

1 -p-l- £ 

Hence, when we modify step 2 of our generation procedure such that we first 
throw the edges (of the underlying random configuration) with p' and then 
with p" we get a faulty random configuration with p. Our proof follows these 2 
probabilistic experiments and consists of 2 steps: 

Step 1 : After throwing with probability p' we have with high probability a linear- 
sized component C and a linear number of isolated paths each of length k. We 
denote by V the collection of these paths. Step 2: After throwing the edges 
with p” we have with high probability at least 1 path P G V connected with C 
via a path consisting of edges thrown in by experiment 2. Moreover, P may be 
considered as a fringe of C: The path P has only 1 neighbour reachable via a 
non faulty edge. 

Step 1 is shown by running sufficiently many (breadth-first) searches on disjoint 
parts of a faulty configuration. With probability bounded away from 0 (when n 
gets large) an isolated path of length k is found by a single search. Tail bounds 
imply the almost sure existence of a linear number of such paths. We formalize 
our search algorithms as (breadth-first) generation algorithms as in [Go 97]. Our 
algorithm directing the single searches uses the following global variables. These 
are implicitly updated each time they change: E = the set of non-faulty edges 
generated. = the set of faulty edges generated. Free = the set of nodes which 
have not been looked at, that is which do not occur in E U ^E. ^ Free = the set 
of nodes which have already been looked at. Dis = the set of classes which are 
incident with faulty or non-faulty edges generated by previous searches. Path = 
the set of classes discovered via non-faulty edges by the current search. We fix a 
constant 7 , sufficiently small. The single searches are called from the procedure 
Gen: 

Procedure Gen 

1. For i = 1 to 7 • n/A: do 

2. Pick a class S ^ Dis deterministically; Path := 0 

3. Search(S'); Dis := Dis U Path 

The variable Path in the subsequent procedure Search(S') is used as follows: Path 
will always contain what can be looked at as part of an isolated path consisting 
of k classes. If Path contains a different structure, the current Search(S') stops 
with an error. The interior classes of this path contain no node from Free. Hence 
these classes are incident with sufficiently many faulty edges in order to make 
Path isolated. A class added as new end class to Path will contain only nodes 
from Free. This is important to bound the error-probability of Search(S') away 
from 1 . 
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Procedure Search(S') 

1. Path := {S'} 

2. while |Path| < k and Path has a class T at an end with \T n Free| > 0 

3. Pick such a class T deterministically. 

4. while \T n Free] > 0 do 

5. Pick a node a; G Tn Free deterministically. 

6. Choose y G Free\{a;} from the uniform distribution. 

Add the edge {x, y) to E with probability p\ to ~^E with 1 — p' . 

7. if (x, y) G E then 

Add the class of y, we call it R, to Path. Return 

with error, if |i? H Free| < d — 1 or path cannot any 

longer be extended to an isolated path of k classes due to R. 

8. if (x, y) G ~^E and this implies that Path 

cannot be extended as required, then return with error. 

A successful return of Search(S) occurs, when the while-loop from 2. is left 
regularly. This happens for a given call of Search(S) inside Gen with proba- 
bility bounded away from 0. To this end it is helpful to visualize each com- 
putation of Gen as a path in a probability tree. Conditioning on an arbitrary 
history El of Gen which ends just before a new execution of Search(S) we get that 
Prob[Search(S) ends without an error ] > e{d, k' k) > 0. Hence the expected 
number of successful returns from Search(S) is at least (7 -nfk) ■ e which is linear 
in n (but decreasing in k and d) . Tail bounds for the binomial distribution imply 
that we get quite surely a linear (in n) number of isolated paths in our set P and 
step 1 is finished. We show step 2: We condition everything what follows on the al- 
most sure event that the underlying configuration is an expander. In the underly- 
ing non-faulty configuration we define Cq = C and Q+i = N{Ci) (cf. fact 1). 
The expansion property implies that there is a j depending on \C\ and \V\ the 
number of classes in C and V, such that Cj and V have > 1/2 • |7^| many classes 
in common. This is simply because Cj has n- (1 — (1/2)|7^|) many classes . As j 
is constant, we have a linear in n number of classes W in V over which V is 
entered. That is we have a path from C to W which does not hit V before. 
As k is fixed, we have a linear number of paths in 7^, Pi, ... , Pq,„ which contain 
such an entering class W. Throwing in the edges of the underlying configuration 
with p" gives us a linear number of Pi connected to C in the required way (with 
high probability) using the ’’entering” paths. 



2 The Edge Deletion Process 



For a node x G W, Deg<2>(a;) =Deg^(W), the number of nodes in the class W 
incident with non-faulty edges and W is the class to which x belongs. The 3- 
core of E is the maximal subconfiguration such that Deg,f / (x) > 3 for each x 
from <P' . The following edge deletion process iteratively strips away the edges 
incident with classes of degree < 2. 
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Input: A faulty configuration <P € Con(L, n). 

Output: The 3-core of <P. 

while ^ has classes of degree < 2 do 

^ := <P\{{x, y}|Deg< 2 ,(a;) < 2 or Deg^(y) < 2} 
od; return <P as the 3-core. 

For our probabilistic analysis of this algorithm we use the probability spaces 
Con(m, n) = Con(m, uq, U 2 , where n = uq + ni + U 2 + n^, which 
contains as atoms the faulty configurations with exactly: m edges, n classes of 
which are: no of degree 0, ni of degree 1, U 2 of degree 2, and ns of degree > 3. 
Each configuration is equally likely. When considering Con(m, n) we use the 
following names and abbreviations: A class is heavy in € Con(m, n) if it is of 
degree > 3 in it is light if its degree is 1 or 2. A node is heavy or light, iff it 
belongs to a heavy or light class. We let 

2m — I h 

I = ni + 2 u 2, n = 2m — t, and A = = — . 

ns ns 

Hence, I is the number of light nodes, h the number of heavy nodes and A is 
the average degree of a class W given that W is heavy. Note d > A > 3. 
When we run our edge deletion process with <P G Con(m, n) we let <l>i be the 
configuration obtained from <!> after i rounds of the loop (<?o = ^)- Moreover 
we have m^, rii, no,i, ns^i, h and Xi for i > 0 as random variables on 

Con(m, n). For example = no{<Pi) = the number of classes of degree 0 

in <I>i. As usual when a probabilistic analysis is possible, the (uniform) distribu- 
tion needs to be preserved. 

Lemma 1. (a) We consider Con(7Ti,h) and Con(m',h'). For 0, 0' e Con(m',h') 
we have: |{<? G Con(m,h)|^i = 0}| = |{<? G Con(m,h)|<?i = 0'}|, that is each 
configuration from Con(m', n!) is hit by the same number of configurations from 
Con(m, n) after 1 round of edge deletion. 

(b) For 0, T G Con(m, n) we have for a random G Con(L, n) and i > 0 fixed: 
Prob[<?i = 0] = Prob[<?i = F], Hence, we have in the space Con(L, n): Condi- 
tional on the event that <l>i G Con(m,h), each configuration from Con(m,n) is 
equally likely. 

We need to determine the probability distribution of the degree of a fixed class 
of degree > 3 in Con(m,h). In general, i. e. when the degree is unbounded, we 
have the concept of Poissonization in this situation [BrFrlJp 93, PiSpWo 96]. 
The present situation leads to the concept of ’’binomialization”: 

Lemma 2. We consider the probability space Con(m, h), conditional on the 
event that W \ , . . . , Wn^ are the actual heavy classes. Let 0 < p < 1 and Y = 
Y{p) = [Bin((i, p)|Bin(d, p) > 3], where = means equality in distribution and 
Bin(d, p) is the binomial distribution with parameters d and p. Let {Xi , . . . , Xn^) 
be the random vector of the degrees of our heavy classes Wi , . . . , ■ 

(a)Let Yi, , Yn^ be i.i.d. with Yi = Y. Then 

(A"i, . . . , Xns) = [(Fi, . . ■ , Fna)! = h]. The right-hand side of the equation 
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means (Yi,...,y„ 3 ) conditional on the event that '^Yi = h, the number of 
heavy nodes. 

(b) The function EY = {EY){p) = E[Y{p)] is strictly increasing for 0 < p < 1. 
For A with 3 < A < c? we have a unique p such that 
{EY){p) = X = E[X,\. 



Some more abbreviations: When considering Con(m, n) where S < X < d we use 
the following parameters: p € (0, 1] is uniquely given by: 

A = E[Bin(d, p)|Bin(d, p) > 3]. Moreover A = p-d. Hence p causes the Y^ to have 
the right expectation. Note, 0 < A < d for p e (0, 1] and A > A for p e (0, 1) but 
X = X = d for p = 1. As usual in the present context Aq, Ai, . . . are considered 
as random variables on Con(m,h), the index indicating the number of rounds 
of edge deletion (of course Aq is constant, but for i > 0 the A^’s depend on the 
actual ^). 

Concerning the probability distribution of the degree of a fixed heavy class W 
we have the following statement which follows from the definition of the Y^ : 



Prob[Deg^(FF) = k] 



Prob[Yj^ = fc] • Prob[Y2 + • • • + Y^g = h — k\ 

Prob[Yi + . . . + Y„3 = h] 



The local limit theorem for sums of lattice-type random variables [Du 91, Fe 71], 
allows us to estimate the fraction Prob[Y 2 -l-- • --l-Y „3 = h — fc])/(Prob[Yj^ -I- • • • + 
Y „3 = hj), as 1 -I- o(l) (for > oo) provided the p from lemma 4(b) is chosen 
as p. In this case we have h = E[^ Yj], see also [BrFrlJp 93], [PiSpWo 96] for 
a similar application of the local limit theorem. 

Corollary 1. We consider the probability space Con(m, h) conditional on the 
event that the class W is heavy. We let go to infinity and assume that A is 
bounded away from 3 and d. Then we have the following estimate: 

Prob|Deg,W = t] < 

where B(p,S) is the probability that the binomial random variable with param- 
eters d and p is > 3. 



3 One Round of Edge Deletion 



We use some more abbreviations: 



p = 



B(p,3) 



M = 



1 - 



I 

2m 



uj = V ■ p. 



Note that the values depend on the actual probability space Con(m, h) con- 
sidered. We consider po, ^7ij • ■ ■ j MOj Mij • ■ ■ as random variables on Con(m,h), 
the index indicating the number of rounds of edge deletion. Moreover, 6 with 
1 > i5 > ^ is fixed. Of particular importance is ly, the proportion of light nodes 
among all nodes still under consideration. 
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To calculate probabilities in Con(?ri,n) it is helpful to consider each Con(m, fi) 
as generated by the following string generation procedure. Note that each config- 
uration is actually generated m!-2™-times: 1. Choose 2m nodes from W according 
to the degree constraints given by n. Each single possibility is equally likely. 2. 
Choose a random permutation of the nodes chosen in 1. We visualize this as 
putting the nodes chosen in 1. randomly into 2m slots: first slot 1, then slot 2, 
. . ., slot 2i — 1, 2i represent 1 edge. Each possibility has the probability j 2 ^.- 
The following lemma 6 follows by analyzing the above generation procedure, the 
sharp concentration result follows from tail estimates for martingales (Azuma’s 
inequality), [AlSp 92], chapter 7. See [BrFrUp 93] for the application of Azuma’s 
inequality in a similar context. 

Lemma 3. Let e,e' > 0. We consider the probability spaces Con (m,h) where 
0<e</5<l — £<1. (This restriction implies that A is bounded away from 0 
and d, and therefore 77 and ^ can be bounded above by a constant.) We assume 
that n^> e' ■ n. After 1 round of edge deletion we have: 

(a) E[mi] = m(l - + 0(^)). 

(b) E[no,i -l-no] < n^rj ■ Y,k >3 m))- (Note that no,i - ^ - no is the 

number of heavy classes which get degree 0 after 1 round of edge deletion.) 

(c) E[m,i] < ■ J2k>2 + 0{^)) 

(d) E[n2,i] < ’ Efe>i + 0(^)) 

(e) E[Zi] < E[ni_i] -|- 2E[n2,i] 

Moreover, for all these random variables we have the same sharp concentration 
result as for mi: Quite surely (in Con(m,h)) jmi — E[mi]j < n^. 

First we bring lemma 6, in particular the sharp concentration part, into a more 
manageable form. To this end we fix S' with S' > S and assume that the number 
of light nodes I is not to small, i. e. I > . This implies that we have for the 

deviation from the sharp concentration result that = o{l), making the 
following estimates possible: 

Theorem 3. We consider Con(m, n) and make 1 round of edge deletion. We 
let £ > 0 be fixed and assume = 713^0 ^ ' n, hence m > e ■ n. We make the 

following additional assumptions about Con(m,h): d/2 > A > 16, 1 / < 1/10, 
w < 3/4, I > . We have quite surely (with respect to Con(m, h)) after 1 round 

of edge deletion: 

(a) mi = m • (1 — i/)^ -I- 0{n^). (b) no.i — I — no < ■ n. 

(c) ni,i < no ■ ^ - u. (d) ri2,i < n^ ■ ^ ■ u. 
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(fi) « 3 ,i ■ {1 — ^ ■ i') . (f) i^i < ^ ■ ly (Then li = I'l ■ 2m < ■ 2m = ^ • Z. It 

is important that the proportion of light vertices decreases geometrically.) 

(g) Aq > Ai > Ao ■ (1 — 3v). (Ao = the A of Con(TO, h).) (h) ui < u. 

Before applying theorem 7 iteratively we look at the beginning applying Azuma’s 
inequality again: 

Theorem 4. We consider the probability space Con(L,n). Let r = k/cZ be the 
probability that a given edge is present. Quite surely we get: 



(a) no < n-exp(— k). (b) ni < n-exp(— k) • (c) U 2 < n-exp(— k) • A 



4 Several Rounds of Edge Deletion 

Our edge deletion process is analyzed by an iterative application of lemma 6 
and theorem 7. Some intuitive explanation: At first one would possibly try to 
apply only lemma 6 iteratively: the expectation of the number of light vertices 
decreases geometrically which should give an expectation of 0 light vertices after 
logarithmically many rounds of edge deletion. Then Markov’s inequality implies 
the non-existence of light vertices with high probability. But, v is constant ini- 
tially and lemma 6(a) does not exclude the possibility of ending up with the 
empty 3-core after logarithmically many rounds. Moreover the iterated appli- 
cability of the lemma cannot be ensured by considering the expectation alone. 
Therefore we need theorem 7, in particular (f), (g). The other points are neces- 
sary to ensure the iterated applicability (quite surely) of the theorem. We use 
theorem 7 as long as I > . Afterwards we simply use lemma 6 as indicated 

above. Note that then v = o(l), as m is still linear in n . 

Theorem 5. We let k > 20 (then the number of edges is L > 10 • n) and d > 42 
fixed. We consider the probability space Con(L,n): 

(a) Quite surely in Con(L,n) we have: 21 > A > 19, < 1/100, u < 1/2. 

(b) Let j < n, conditional on the event . . . ,lj-i > we have quite 




(d) I < n ■ exp(— k) if k > 1. (e) ri 3 > n • (1 — 3 exp(— k)) if k > 1. 



surely: 



(ii) uij < 

(iii) \> Xj > X- rifcd(l - (iv) ri3,j > n3,o • rifcc)(l “ 



(c) There is a j = 0(log(n)) such that we have quite surely: There exists a 
k < j such that h ^ ■ 
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Note that (b) (i) and (iv) of theorem 9 imply that the heavy nodes do not 
disappear. Finally the iterative application of the expectation part of lemma 6 
is given by: 

Theorem 6. We consider Con(TO,fi) where Z < | > A > 17 and ri 3 > e-n 

for a fixed e > 0. Let 0 < j < 0(log(n)). Conditional on the event that 
I = lo,h, ■ ■ ■ , Ij-i < , we have: 

(a) E[/j] < (^)'^ ■ I- (b) Ij < n^' quite surely. 

(c) ri 3 j > ri 3 fi — 0{j ■ ). (d) rrij > toq ~ 0{j ■ \ (e) | > Xj > 16. 

We now combine theorem 9 and 10 to get the final result: With k > 20 and 
d > 42 the edge deletion process applied to a random member of Con(L, n) 
yields with high probability a 3-core of size > £ • n, where £ is independent of d. 
The techniques from [Bo 88] show, that this 3-core is with high probability an 
expander with expansion constant > ^ • 3/d • 2 where ^ is a not unrealistically 
small constant < 1. 
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Abstract. Simon showed a problem that can be solved in polynomial 
time on a quantum Turing machine bnt cannot be solved in polynomial 
time on any probabilistic Turing machine. However, his quantum algo- 
rithm is evaluated in expected time. In this paper, we show a polynomial 
time quantum algorithm in worst case for Simon’s problem. Brassard and 
Hpyer also construct a polynomial time quantum algorithm in worst case 
for it. However, our algorithm is much simpler than their algorithm. 



1 Introduction 

Feynman pointed out that current computers may not be able to efficiently sim- 
ulate some quantum physical phenomena because the computational principle of 
current computers is based on classical physics [7]. Furthermore, he also pointed 
out that in order to efficiently simulate any physical phenomenon, we will need 
a new-type computer based on quantum physics, a quantum computer. After 
that, as a model of the quantum computer, Deutsch proposed quantum Turing 
machines (QTMs) [5], and Bernstein and Vazirani mathematically formalized 
the QTMs [1,2]. 

The first indication that a QTM may be more powerful than current com- 
puters is Deutsch and Jozsa’s result in [6]. They showed a problem that can 
be solved in polynomial time on a QTM but can not be solved in polynomial 
time on any deterministic Turing machine (DTM). However, this problem can 
also be solved in polynomial time on a probabilistic Turing machine (PTM). 
In 1994, for the first time, Simon showed that QTMs are more powerful than 
current computers, i.e., he showed a problem that can be solved in polynomial 
time on a QTM but can not be solved in polynomial time on any PTM [12,13]. 
His problem is as follows: 
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Let f : {0,1}" — > {0,1}'" with n < m he a polynomial time eomputable 

funetion such that for any x ^ x' £ {0, 1}" and a nonzero s € {0, 1}", 

f{x) = f{x) iff X =x®s. 

Then, find such a nonzero s. 

However, since his quantum polynomial time algorithm is evaluated in ex- 
pected time, we do not satisfy a little. Therefore, we have investigated whether 
there exists a quantum polynomial time algorithm in worst case for Simon’s 
problem. In this paper, we show a polynomial time algorithm in worst case for 
it. Brassard and Hpyer also construct a polynomial time algorithm in worst case 
for it, however, their algorithm is more complicated than our algorithm [4]. This 
is because that their algorithm is combined with Simon’s algorithm and Grover’s 
database search algorithm [8] . On the other hand, our algorithm consists of two 
simple routines (Lemma 1 and Lemma 2) and does not use any probabilistic 
argument. 

2 Quantum Computation 

In this section, we briefly review a quantum computation. For more detail, we 
refer the reader to [1,2,5]. Throughout this paper, let C and Z be the set of 
complex numbers and the set of integers, respectively. 

Like classical Turing machines, a QTM M also consists of a finite control, an 
infinite tape, and a tape head. Let Q be the finite set of states and S the finite 
set of tape symbols. A configuration of M is defined by a triple (p, h, a), where 
p a Q denotes a current state, h G Z denotes an index of the tape cell over which 
the tape head is currently located, and a G E* denotes a tape content in the 
tape. Then, the state transition function S of M is 

6 : Q X E X E X Q X {L, R} ^ C. 

This transition function 5{p,a,b,q,d) = Am means that if M reads a symbol a 
in a state p, it 

1. writes a symbol b on the cell under the tape head, 

2. changes the state into q, and 

3. moves the head one cell in the direction of d G {L, R}. 

Further, let ci be a configuration before a transition and C 2 a configuration after 
the transition. Then, the value is a transition probability that M changes 

its configuration from c\ to C 2 (the value Am is called an amplitude). Thus, the 
QTM differs from a classical one in the sense that it evolves on a superposition of 
configurations and we call a computation using this property a quantum parallel 
computation. 

Next, we denote that the state transition function 6 corresponds to a time 
evolution matrix (a physical system) Us of the QTM M, i.e., each row and column 
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of Us corresponds to a configuration of M. Let ci and C 2 be two configurations 
of M. Then, the element corresponding to C 2 row and Ci column of Us is the 
value of 5 evaluated at the tuple that transforms ci into C 2 in one step. If no 
such tuple exists, the corresponding element is zero. Moreover, from physical 
restrictions, the time evolution matrix Us must be a unitary matrix, i.e., the 
relations 

UlUs = UsUj = I 

must be satisfied, where Ug is the transposed conjugate of Us and / is the unit 
matrix. 

Thus, a configuration corresponds to a state vector and a state transition 
function S corresponds to a time evolution matrix. Therefore, a computation on 
a QTM is executed in the following way: let be a vector corresponding to a 
configuration, |'0o«t) a vector corresponding to a next step configuration, and U 
a matrix corresponding to a state transition function. Then, 

\'4^out) — U\'tpin}- 



We also denote it by 

I'^in) ^ \'4^out) ' 

Moreover, the result of a quantum computation is obtained with physical mea- 
surements (i.e., with observations of the physical systems constructing the QTM) 
as follows: when a superposition of configurations |'0) = '^iOn\ci) is written on 
the tape, we can measure (p component of ijj with probability |(:/5|'(/’)p. Especially, 
we can measure Ci component of ijj with probability |aip. 

Finally, we denote a quantum bit, the minimum unit of information. A quan- 
tum bit (or qubit) has a chosen computational basis {|0),|1)} corresponding to 
classical bit values 0 and 1 and we denote by 

|0> = and |1) = . 

For n-qubit (n > 2), we denote by 

1^1 ; ^2 j • ■ ■ j ^n) ; 

where Xi € {0,1} for i = l,2,...,n. More precisely, it is represented as the 
tensor products of n qubits. 

\xi,X 2 , ...,Xn) = |a;i) 0 \X 2 ) 0 • • • 0 \xn){= |a:i)|a: 2 ) • • • |a;„)), 
where 0 is the tensor product operator. 

3 A Polynomial Time Algorithm for Simon’s Problem 

In 1994, Simon showed a problem indicating that QTMs may be more powerful 
than PTMs, i.e., he showed that there exists a problem that can be efficiently 
solved on a QTM but can not be efficiently solved on any PTM [12,13]. 
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Simon’s problem 

Let f : {0, 1}" ^ {0, 1}™ with n < m he a function such that for any 
X x' € {0, 1}” and a nonzero s G {0, 1}", 

f(x) = fix') iff x' = x®s. 



Then, find such a nonzero s. 

Let a = (oi, . . . , a„), b = (6i, . . . , bn) G {0, 1}”. We define a > 6 by 

n n 

and we define inner product a ■ 6 of a and b modulo 2 by 

n— 1 

a • b = affi (mod 2). 

i=0 

Now, we define a transform used in the following lemma. Let Hq be g-dimensional 
Hadamard matrix {q = 2"), i.e., 




and 

Hq/2 Hq/2 
Hq/2 -Hql2 

Then, the quantum Walsh-Hadamard transform Wh is as follows: 



Hq = H2 



•Hq,2 = 



Wh, 



1 



E 

he{o,i}" 



This transform can be executed in 0{n) time on a QTM. 

We can solve Simon’s problem by obtaining a sufficient number of different b’s 
such that s ■ b = 0 (and solves a linear system of equations). Simon’s algorithm 
can obtain a sufficient number of different b’s only in polynomial time in expected 
sense. On the other hand, we show that our algorithm can obtain a sufficient 
number of different b’s in polynomial time in worst case. 

First, by Lemma 1, we can obtain a nonzero b such that s-b = 0 and g-b = 1 for 
a, g G {0, 1}". This lemma means that when we use a different g, we may obtain 
a different b. Lemma 2 and Lemma 3 certify that we can find a g for obtaining a 
different b certainly. Thus, our algorithm can certainly find a sufficient number 
of different b’s. 



Lemma 1. Let f be a function that satisfies the condition of Simon’s problem. 
Given an arbitrary nonzero g G {0, 1}”, there exists a quantum algorithm that 
returns b G {0, 1}" such that s ■ b = 0 and g ■ b = 1 if g s. Moreover, 
the algorithm runs in linear time in n and in the time required to compute f. 
Especially, when f can be computed in polynomial time of size n, the algorithm 
runs in polynomial time of size n. 
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Proof. Let 

gi(a) = max{/(a),/(a0g)} and 

02(0) = /'^ if /(a) > /(a©g), 
f 1 otherwise. 

Then, we define two operations Ug^ and Ug.^ by 
|a)|0) |a)|5i(a)) and 

|a) ^ (-l)®=i“i|a). 

If the function / can be computed in Tf{n), it is clear that each running time 
of C/gj and Ug^ is 0 {Tf{n)) time. 

Now, let us show a quantum algorithm to find b. First, we apply Wn2n to 
the first register on |0)|0) and then apply Ug.^^, where we call si (resp. S2) for 
|si)|s2) the first (resp. the second) register. 

io)io)— "2(75 i; i“)i»> 

aS{0,l}" 

^ ^ E |a)l5i(a)) 

aS{0,l}" 

Let K C {0, 1}” be a subset such that for any a G {0, 1}", 

\K n {a,a©s,a©g, a©5©s}| = 1. 

Since gi{a) = gi{a(B g) and gi{a) yf ffi(a') for any a ^ a' G K, 

XI I“)l5i(a)) = + l“® S'® 

aG{0,l}" aGiC 



Next, we apply Ug.^. fo the first register. Since (— 1)32(“) = _(^_x)S2(a®g) 

(_l) 32(a) _ (_2)ff2(a®s)^ 

X + |a ® s) + |a © g) + |a © 5 © s))|5i(a)) 

a^K 

^ 2^ ^ + |a © s) - |a © 5) - |a © 5 © s))|gi(a)). 

aGK 



Finally, we apply to the first register. From (a © /?) • 7 = (a • 7) © (/? • 7), 

we have (-l)(“®/3)-7 = (_i)a-7(_i)/3-7. Thus, 



X + |a ® s) - |a © g) - |a © 5 © s))|gi(a)) 

a^K 

1 



^X X ((-!)“"' ©(-I)®®®-' 

_(_l)©®s)-h _ (_l)(“®sffi*)®^|5)|gi(a)) 



aeK bg{o,l}" 



a&K bg{o,l}" 
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Then, by measuring the first register, we can obtain b which satisfies s-b = 0 and 
g-b = 1. Moreover, total running time of this quantum algorithm is 0{n + Tf(n)) 
time. □ 

A set B C {0, 1}" is defined linearly independent if 6 yf every 

b G B and for any subset B' C B — {b}. 

Lemma 2. Given a linearly independent set B C {0, 1}”, there exists a polyno- 
mial time algorithm that returns a nonzero g G {0, 1}” such that g ■ b = 0 for 
every b G B. Moreover, g is unique if \B\ = n — 1. 

Proof. A nonzero g G {0, 1}” which satisfies the condition of this lemma can be 
find by using a standard technique in the coding theory (e.g., see [9]). Suppose C 
is a binary linear code such that B is a basis of it. Then, the dual code of C, C-*-, 
can be defined as C-*- = {c G {0, 1}” | c • 6 = 0 for every b G B}. From [9], if B is 
given, a basis G of C'*' can be obtained in polynomial time in |i3| and n. Since 
\B\ < n, G can be obtained in polynomial time in n. Then, since G C C'*", every 
g G G satisfies the condition of this lemma. 

Furthermore, it is known that |G| = n — \B\, and C-^ = {0,(?} if G = {5} 
for some nonzero g [9]. Therefore, the nonzero g which satisfies the condition of 
this lemma is unique if |i?| = n — 1. □ 

This lemma implies that we can certainly find a nonzero s if there exists a 
linearly independent set B C {0, 1}" such that \B\ = n — 1 and s • 6 = 0 for 
every b G B. The problem is how to obtain such a linearly independent set. The 
following lemma show that we can obtain such a linearly independent set by 
selecting a good g for each application of Lemma 1 . 

Lemma 3. Let B (\B\ < n — 1) be a linearly independent set, and g G {0, 1}" 
an arbitrary nonzero element such that g - b = 0 for every b G B. Then for any b' 
such that g ■ b' = 1, B U {b'} is also linearly independent. 

Proof. Since g-{b®b') = (g • 6) © (g • b'), we have g ■ (0f,gs' b) = 0 for any subset 
B' C B. Thus, b' yf (0hgB/ b) for any b' such that g ■ b' = 1 and for any subset 
B' C B. Therefore, B U {b'} is linearly independent. □ 

By applying Lemma 1 and Lemma 2 as subroutines, the following theorem 
is obtained. 

Theorem 1. Let f be a function that satisfies the condition of Simon’s problem. 
Then, there exists a quantum algorithm that solves Simon’s problem in polyno- 
mial time in n and in the time required to compute f in worst case. Especially, 
when f can be computed in polynomial time of size n, the algorithm runs in 
polynomial time of size n in worst case. 

Proof. First, we construct a quantum algorithm for solving Simon’s problem. 

— Step 1: Set B := 0. 

— Step 2: Select a nonzero g G {0, 1}”. 
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— Step 3: Repeat the following steps while g ^ s. 

• Step 3-1: Find a nonzero b G {0, 1}” such that s ■ b = 0 and g ■ b = 1 by 
applying Lemma 1 for g, and set B := B U {b}. 

• Step 3-2: Find a nonzero g' such that g' • 6 = 0 for every h G B by 
applying Lemma 2, and set g := g' . 

— Step 4: Return g (i.e., s). 

Now, we show the correctness of our algorithm. By Lemma 3, Therefore, 
when we find the set B of size \B\ = n — 1, we can find the nonzero s because 
s ■ b = 0 for every b G B 

Finally, let us consider the running time of our algorithm. Let Tf(n) be the 
time required to compute /, and G{n) the time required by applying Lemma 2. 
Note that G(n) is some polynomial of n. The main step is Step 3 (other steps 
are at most 0{n) time). The running time of Step 3-1 is 0(n + Tfin)) time by 
Lemma 1 and the running time of Step 3-2 is 0{G(n)) time. Step 3 is repeated 
at most n — 1 times. Then, the total running time of our algorithm is 0{n? + 
nTf{n) + nG{n)) time in worst case. □ 

Note that the term nG{n) in the running time can be improved, because the 
running time of an application of Lemma 2 depends not only on n but also on 
the size of B. 

4 Conclusions 

We showed a quantum algorithm that solves Simon’s problem not in expected 
sense but in worst case. This result is not new because another algorithm to 
solve the problem in worst case [3]. However, our algorithm is much simpler 
than their algorithm. Moreover, it implies by Lemma 1 that an extended version 
of the problem in [3] (i.e., the range of the function / is extended from {0, 1}"“^ 
to {0, 1}™ for any m > n) can also be efficiently solved. 

Furthermore, by a simple modification, our algorithm can efficiently solve 
the extended Simon’s problem defined in [4]. 

Extended Simon’s problem [4] 

Let f : {0, 1}” — *■ {0, 1}™ with n < m be a function such that for any 
X ^ x' G {0, 1}" and a set S C {0, 1}", 

f{x) = f{x') iff x’ = X (B s for some s G S. 

Then, find a basis of S. 

This problem can be given in such a way that the size of S is unknown. 

Shor showed that there exists polynomial time quantum algorithms for fac- 
toring and finding discrete logarithm, but his algorithm is also constructed in 
expected sense[10,ll]. Therefore, in this related work, the most interesting open 
problem is whether there exist a polynomial time quantum algorithm for fac- 
toring or finding discrete logarithm in worst case. Our result may help to solve 
these open problems. 
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Abstract. In this paper, we study the possibility of Occam’s razors 
for a widely studied class of Boolean Formulae : Disjunctive Normal 
Forms (DNF). An Occam’s razor is an algorithm which compresses the 
knowledge of observations (examples) in small formulae. We prove that 
approximating the minimally consistent DNF formula, and a generaliza- 
tion of graph colorability, is very hard. Our proof technique is such that 
the stronger the complexity hypothesis used, the larger the inapprox- 
imability ratio obtained. Our ratio is among the first to integrate the 
three parameters of Occam’s razors : the number of examples, the num- 
ber of description attributes and the size of the target formula labelling 
the examples. Theoretically speaking, our result rules out the existence 
of efficient deterministic Occam’s razor algorithms for DNF. Practically 
speaking, it puts a large worst-case lower bound on the formulae’s sizes 
found by learning systems proceeding by rule searching. 



1 Introduction 

The learnability of Disjunctive Normal Form formulae (disjunctions of conjunc- 
tions) is a central problem in machine learning [27]. In 1984, Valiant studies the 
learnability of this class, and remarks that “t/ie attraction of this class is that 
humans appear to like it for representing knowledge as is evidenced, for example, 
by the success of the production system paradigm and of Horn clause Zo^ics” [35]. 
He proves that a subclass of DNF is learnable, and leaves as an open problem 
whether the whole class is learnable. Since then, many theoretical studies have 
investigated the learnability of DNF or subclasses [19,27,2,1,4,5,8,15,20,23,28]. 

Simultaneously, many programs for machine learning were designed to learn 
efficiently from examples. In each of these programs, the algorithms has access 
to a learning sample and tries to build a small function approximating as best 
as possible the observed examples. According to [10], systems that learn sets of 
rules (DNF in the Boolean framework) have a number of desirable properties : 
they are easy to understand, they can outperform decision-tree learning algo- 
rithms on many problems, they have a natural and familiar first-order version 
(Prolog predicates), and techniques for learning propositional rule sets can often 
be extended to the first-order case [29]. Many learning algorithms either build 
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rules, or have a stage consisting in searching for rules that are postprocessed, 
or are aimed at producing formulae that can be easily translated into rules, 
[7,10,9,12,21,24,25,26,31,30,29,32,33,36] and many others. 



Practical and theoretical results often focus on an aspect of approximation 
they both share : given a set of examples {e.g. description of animals for which 
we know if they have or not some illness), can we devise an efficient algorithm 
which can find a small formula (i.e. a mean of classifying observations) consis- 
tent with all examples (making no errors)? 

Theoretically speaking, this aspect is often related to the principle of Occam’s 
razors [6]: an Occam’s razor for a class {i.e. a set) of concept representations C 
(each of which is a function mapping observations to classes) is an algorithm 
that, given a learning sample LS whose labels are given by some unknown tar- 
get concept t G C, can produce in time polynomial in |T5'|, n (the number 
of description variables), jtj a formula h G C satisfying to the two following 
conditions : h is consistent with LS (it does not make errors) and has size 
satisfying \h\ < \LS\°'(n\t\)^ (with 0 < a < 1 and b > 0). The principle of Oc- 
cam’s razors [18] states that in order to learn, a system should compress the 
information contained in the examples. This principle was originally stated by 
philosoph William of Occam (1285-1349), and led to theoretical results in the 
PAC-learning model of Valiant [34] : learning is in fact equivalent to finding 
Occam’s razors [18]. 

Practically speaking, machine learning algorithms producing rules are almost 
always aimed at producing small sets of rules, because they are easy to under- 
stand for the non-expert, and they appear to be sufficient on many problems [17]. 



The principal result on the inapproximability of DNF comes from [19]. They 
show that DNF is as hard to approximate as a proglem related to a generaliza- 
tion of Graph Colorability (which does not have commonpoints with ours). [13] 
prove that Graph Golorability is hard to approximate to within (VO < (5 < 1). 
Using the reduction of [19], we are able to show that the consistent DNF with 
minimal size is not approximable (in size) to within (VO < i5 < 1), where n is 
the number of description variables of the examples, a measure of the problem’s 
complexity. In this paper, we first prove that the upperbound of S can be re- 
moved : the result holds in fact Vi5 > 0. We go further into negative results, and 
prove that size-|t| DNF cannot be approximated by DNF having size not greater 
than a, 6, c are any constants satisfying > a > 0, 6 > 0, and 

1 -I- > c > 0. \t\ is the number of monomials (conjunctions) of the target con- 

cept. This proof is stated under the hypothesis NPt^ZPP, where ZPPdenotes 
the class of langages decidable by a random expected polynomial-time algorithm 
that makes no errors [3] . 



In order to achieve our result, we firstly prove an equivalence of approximat- 
ing DNF with a generalization of graph colorability. We then prove our result 
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on the inapproximability of DNF by proving an inapproximability result on the 
generalization of Graph Golorability. 

2 Equivalence between Approximating DNF and a 
Generalization of Graph Golorability 

In this paper, we are interested in approximating optimization problems. An 
optimization problem contains an instance, the definition of a feasible solution, 
and a cost function defined for any feasible solution. The aim of any approxima- 
tion algorithm is to find feasible solutions whose cost {e.g. number of colors for 
coloring a graph) is as close as possible from the problem’s optimum. We also 
use the cost notion for instances : the cost of an instance is the optimal cost 
among all feasible solutions for this instance. 

Let iF be a class of Boolean formulae; any of its elements, /, is a function 
/ : {0,1}" — > {0,1}. An element x G {0,1}" is an example. It is composed 
of n binary variables {xi, Xn} assigned in {0,1} (the negative and positive 
literals). The value f{x) is the class € {0, 1} that / assigns to x (the negative and 
positive class). The size of any formula / is denoted |/|. We investigate the class 
of DNF, set of formulae described as a disjunction of monomials. A monomial is 
a conjunction (A) of literals (a literal is a Boolean descriptor, taking value either 
True or False). We are interested by the possibility, for some efficient (Ptime) 
algorithm, to approximate the following minimization problem: 

— Name : Opt(DNF) 

— Instance : A learning sample LS 

— Feasible Solutions : Formulae from DNF consistent with LS 

— Cost Function : Size of the formula (number of monomials) 

It is well-known [19] that this problem is as hard as the problem Opt 
(Independant-set cover) (this is the same as the graph-coloring problem [14]; 
however, this name is convenient for our proofs). We show in this paper that it 
is in fact as hard as a generalization of the Opt (Independant-set cover) problem 
(for any integer k > 0, [k] denotes the set {1, 2, ..., k}): 

Definition 1 Opt(Multi independent-set cover) 

— Name ; Opt(Multi independent-set cover) 

— Instance .■ G® = (A®, A®), a graph presenting the following form : for 
some positive integer d, X® is partitionned into Ai, ..., Xd such that if d> 1, 
VI < i < j < d, none of the edges between Xi and Xj are in E® . 

— Feasible Solutions .■ a cover of X® in subsets si, ...,Sfc such that : 

1. VI < i < /c,Vl < j < d, Si C\ Xj yt 0 and induces an independent set 
in Xj. 

2. y{xi,X2,...,Xd) G G [k] :\/l G [d],x/ G Sj 

— Cost Function .■ k, that is, the number of subsets used to cover A®. 



240 



Richard Nock et al. 



Note that every graph can be represented according to the preceeding definition, 
but the uniqueness of the representation is not ensured for a given graph. The 
cost of a graph G instance of Opt(Independant-set cover) is usually written x(G). 
We note Xg(G®) as the cost of a graph G® instance of Opt(Multi independent- 
set cover). The following proposition states the equivalence between these two 
problems (proof omitted due to space limitations). 

Proposition 1 Equivalence of approximating Opt(Multi independent- 
set cover) and Opt(DNF) : For any graph G® = (X®,E®) instance of 
Opt (Multi independent-set cover), we can create in time polynomial in |X®| 
a set of examples LS such that if there exists a feasible solution to Opt (Multi 
independent-set cover) whose cost is k, then we can create in Ptime a DNF hav- 
ing no more than k monomials and consistent with LS . Reciprocally, if there 
exists a DNF of size k consistent with LS, then (i) we can suppose without loss 
of generality that it is monotonous (no negative literals), and (ii) we can gener- 
ate in Ptime a feasible solution to Opt(Multi independent-set cover) whose cost 
does not exceed k. 

3 Opt(Multi Independent-Set Cover) is Hard to 
Approximate 



This section is devoted to the proof of the following theorem. 

Theorem 1 Unless NPQZPP, Opt(Multi independent-set cover) is not ap- 
proximable to within 



P = {l^*l + l^il} 

V l<i<n 



da 



dmax{|E,|} (x.(G®))^-^ 

l<2<n 



where Gi = {Xi,Ei) is the subgraph of G® induced by Xi. The result holds 

- V6 > 0, 

- > a > 0, 

- Vl + ^>c>0. 

This theorem means that no polynomial-time (Ptime) algorithm can guarantee 
to find, from an instance G® of Opt(Multi independent-set cover), a solution 
whose cost does not exceed Xg(G®) x p®. The proof technique basically relies 
on multiplying d instances (or stairs) of Opt(Independant-set cover) to form an 
instance of Opt(Multi independent-set cover), without linking each stair (graphs 
instance of Opt(Independant-set cover)) to the others. Ideally, we would like to 
obtain a relationship such as 



Xg(G®) = xioy 



( 1 ) 



which would ease a lot the proof since it would also blow up any inapproximabil- 
ity ratio p for Opt(Independant-set cover) to p® = p‘^ for Opt(Multi independent- 
set cover). However, this relationship is not true for any graph instance of 
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Opt(Independant-set cover), that is why we need to build particular, very hard to 
solve instances of Opt(Independant-set cover). Figure 1 presents the building of 
a hard instance G of Opt(Independant-set cover), from two reductions t\ and T 2 - 
Reduction Tg is the stacking-up of d instances of Opt (Independant-set cover) to 
form an instance of Opt(Multi independent-set cover). The main theorem we 



Instance of Opt(Independant set cover) 
\ 

\ 

^ d 

Tl T2 \ T§ 

“SAT’’ - ■ o G" c- G' D=- G P=> G® 

/ 

/ 

Instance of Opt(Independant set) 

Fig. 1. Scheme of the reduction n o t 2 o Tg . 



need in this part is the following: 

Theorem 2 (From [16]) Unless NPQZPP, VO < 5 < 1, Opt (Independent set) 
is not approximable to within p" = n{G"Y . 

We suppose for the sake of simplicity, as it is pointed out in [13,16], that theo- 
rem 2 is proven from the decision problem “SAT” . It means that there exists a 
reduction from “SAT” to Opt(Independent set) such that 

— Any graph G" obtained from a satisfiable instance of “SAT” satisfies 
a(G") = g. 

— Any graph G" obtained from an imsatisfiable instance of “SAT” satisfies 
a(G") < g/n{G"Y, VO < (5 < 1. 

We now describe the two first reductions, ti and T 2 - Define as Kr the complete 
graph over r vertices. 

Definition 2 ti [22] From G" , we create the graph product G' = Kj. x G" with 
r > a{G") as follows. Each vertex of G' is a pair < i,v > where i = 1, ...,r is 
a vertex from Kr and v is a vertex from G" . Sets of vertices with the same first 
component induce a clique in G' . Two vertices < i,v >, < j,w > from different 
cliques are adjacent iff their second components v and w are equal or adjacent 
vertices in G" . 

(In that follows, we fix r = n(G")). As pointed out in [22], Section 2.1, we have 
a{G") = a(G'). Let Zp be the field of integers modulo p. 

Definition 3 T 2 [22] Let G' he a graph whose vertices are partitionned into 
cliques Gi, ...,Cr- Let p he a prime at least as large as 



maxjmax |Gi|, r/a(G'), ^/r} 
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The image of G' by T 2 , G, has vertices described as quadruplets < i,k,y,w >, 
where i = k € Zp, y € Zp, and w G Zp. Let ua =< iA,kA,yA,WA > 

and ub =< iB,kB,yB,WB > be two vertices of G. The two vertices are not 
adjacents iff (3s G Zp)(3z G Z^) such that: 

— < iA, kA — s >G G' , < iBi kB — s >G G' and they are not adjacent. 

- WA = {kA - s)yA + z and wb = {kB ~ s)yB + -J. 

Like [22], we apply transformations ti and to any graph G" instance of 
Opt(Independent set). We have 

Proposition 2 G satisfies: 

1. n{G) = rp^ ([22], part 2.1) 

2. a{G) =p^a{G') ([22], corollary 2.2) 

3. If G is built from a satisfiable instance of “SAT”, then x(G) = n{G)/a{G) 
([22], theorem 2.3) 

4- P^X{G) < n{G) 

(proof of [4] omitted due to space limitations). The third reduction, with 
d > 0 an integer, simply consists in stacking-up d times G without linking each 
“stair” to the others. Let G® = t^{G). Since there are n{G)‘^ d-tuples containing 
one vertex from each Xi, ..., X^, since any set from any solution to Opt(Multi 
independent-set cover) contains at most a{G)‘^ of these d-tuples, and since any 
of these d-tuples are covered, it comes Xg(G®) > Furthermore, making 

the d-times cross-product of the independent sets of a solution to Opt(Graph col- 
orability) lead to a feasible solution to Opf(Multi independent-set cover) whose 
cost satisfies Xs(C®) < x{G)‘^- Consequently, 



We refine these inequations. Any graph G® corresponding to a satisfiable in- 
stance of “SAT” leads by proposition 2 to x{G) = n{G)/a{G). Therefore, 
Xg(G®) = x{G)‘^- Any graph G® corresponding to an unsatisfiable instance 
“SAT” leads by theorem 2 and proposition 2 to VO < d < l,Xg(G®) > 
{p^n{G”Y)'^ . From proposition 2, we also get VO < d < l,p^n(G")'^ > 
^ 'n{G)/p^ = r = n{G"). Therefore VO < d < l,p^n(G")^ > 

■ Therefore, for any graph G® corresponding to instances of “SAT” 
either satisfiable or not, we have: 



This relationship is central for our proof; although it is much weaker than equa- 
tion 1, it is still sufficient to prove theorem 1. However, there are two problems 
left : how can we use 3 to prove p®, and can we choose d constant, so that T 3 is 




(2) 




(3) 
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Ptime ? We first solve the first problem. From any “SAT” instance (theorem 2) 
transformed in a graph G using reductions t\ and T 2 , depending on whether it is 
satisfiable or not, there exists 5 > 0 (it is a function of the “SAT” instance, [22]) 
such that either x(G) = p^, or x(G) > p^n{G")^, VO < (5 < 1. 

For any satisfiable instance of “SAT”, we get from inequations 3 : Xg(G®) < 
x(G)'^ = {p^Y ■ For any unsatisfiable instance of “SAT”, we get from inequa- 
tions 3 : Xg{G®) > ^ {p^Y ^ {n{G"Y^~Y‘^ . In order to prove 

theorem 1 , it is sufficient to show a “gap” greater than p® between {p^Y 
(p^)'^ X {n{G''Y^~Y'^ ■ That is, we need to find d such that 



{n{G”Y^~Y'^ > p® ^ d > 



log p® 



log(n(G")2^“i) 

under the constraint 5 > 1 / 2 . And we need to show that d is constant. 



(4) 



We now check the constant value for d satisfying inequation 4. [22] choose 
for p a prime at least as large as max |maxi |Gi|, ^{G") ’ V^}’ "'^hich is n{G") in 
our case. We also have 

Proposition 3 [ ] There exists a constant 0 < a < 1 such that for any 

positive integer M, there exists a prime falling in the interval [M;M^®“]. Fur- 
thermore, we can fix a = 11/20. 

In our case, we can therefore suppose that n{G”) < p < n(G")^®“. We fix 
||<(5<lifc<l, and |§+ < <5 < 1 otherwise (remark that 4 < d < 1 ). 

Fix 

/(a,6,c,a,i5)-b(6-b5a)6 

d=\ 1 (5) 



with 



f{a,b, c. 



a,S) = 2 X 

X 



1 

a(12-|-10c()-|-max{0;(c-l)(4-|-3ct)} 
(6+5a)b I T 
25-1 ^ 



1 

25-1 




Arithmetic calculation gives 



Fact 1 The choice of 6 leads that f{a,b,c,a,S) is positive. Furthermore, 



f{a, b, c, a,S) — d (a(12 -b 10a) -b max{(c — 1)(4 -b 3a); 0}) > 0 

Fact 1 leads to d’’ < at least for suffi- 

cient large-sized graphs. Proposition 2 leads to 

Xg(G®) < x(GY < ^ n{G'YG+3^^ 



( 6 ) 
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The fact that vertices of G can be partitionned into independent sets of size 
(proposition 2) leads to (for convenience, we fix e(G) and n{g) to be respectively 
the number of edges and the number of vertices of G) : e{G) < (rp^- 2 )i 2 i ~ 

rp {rp - 1 ) ^ ^ Ti (G) = rp^ < n(G")®+®“, we get e(G) + 

n(G) < HlIiZ — li _|_ therefore e(G) + n{G) < r'^p^^ = n(G")^^+^°“. 

Putting it altogether, we get that 

P® = f {|T;,| + |Pi|}^ max {|14|}") (Xa(G®))^”^ 

yi< 2 <n / \ l<i<n J 

implies whenever c > 1 



p® < 77,((^"yi2+10a)ad ^ ^^Q//'jf(a,b,c,ct,5) — {12+10a)ad—d{c—l){4+3a) 
Xn(G")^®+®“^'' X „(G")^(^-1)(4+3«) 



and therefore p® < When c < 1, similar calculation 

leads again to p® < We get 

logp® ^ /(a,6,c,a,i5) + (6 + 5a)6 
log(n(G")2^-i) 2i5-l 

But the choice of d also gives 

/(a,6,c,a,<5) + (6 + 5o;)6 
25^1 - ^ 

d is therefore constant, and satisfies inequation 4. The proof of theorem 1 is com- 
pleted. From proposition 1, from the fact that |LS'| < (maxi<i<„{|i?i| -|- |14|})'^, 
n < dmaxi<j<„{|Pi|}, and the target concept’s size satisfies \t\ = Xg(G®), we 
obtain 



Theorem 3 Non-approximability of DNF Unless N P^ZZPP , there cannot 
exist an Occam’s razor for DNF finding formulae whose size does not exceed 
\LS\°'n^\tf’ , where |t| is the size of the target concept, and \LS\ the size of the 
learning sample. The result is true even if we suppose that the target concept 
belongs to monotone-DNF. a, b, c are any constants satisfying: 

— >a>0; 6>0; 1-1- — - > c > 0 



4 Conclusion 

Recall that an Occam’s razor for a class of Boolean formulae G is an algorithm 
that, given a learning sample LS whose labels are given by some unknown target 
concept t G G, can produce in time polynomial in ILS”!, n, |t| a formula h € C 
satisfying to the two following conditions : h is consistent with LS and \h\ < 
\LS\°'{n\t\)^, with a, 6 > 0 and a < 1. With our reasonable complexity hypothesis. 
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our result does not rule out any possibility of deterministic Occam’s razors, even 
if efficient Occam’s razors are proven impossible. However, the advantage of 
our reduction technique is that, the higher the time alloted for T 3 (thus, the 
higher d), the higher the non-approximability ratio. [27] cite the complexity 
hypothesis A^P^DTIME(poZy(2"'')) (for some e > 0). With such hypotheses, 
adaptated to handle zero-error, probabilistic algorithms, it would be interesting 
to see how close to the Occam requirements the non-approximability ratio would 
come. 
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Abstract. We locate the complexities of evaluating, of inverting, and 
of testing membership in the image of, morphisms h : S* A*. By 
and large, we show these problems complete for classes within NL. Then 
we develop new properties of finite codes and of finite sets of words, 
which yield image membership subproblems that are closely tied to the 
unambiguous space classes found between L and NL. 



1 Introduction 

Free monoid morphisms h : S* — > A* , for finite alphabets S and Z\, are an 
important concept in the theory of formal languages (e.g. [6,11]), and they are 
relevant to complexity theory. Indeed, it is well known (e.g. [7]) that NP = 
Closure{<^° , HOMn.e.) C Closure{<^° , HOM) = R.E., where <^° denotes 
many-one AC°-reducibility, HOM (resp. HOMn.e.) is the set of morphisms 
(resp. nonerasing morphisms), and Closure denotes the smallest class of lan- 
guages containing a finite nontrivial language and closed under the relations 
specified. Morphisms and their inverses also play a role in studying regular lan- 
guages and “small” complexity classes: regular language varieties are closed un- 
der inverse morphisms [6], and the replacement of morphisms by “polynomial 
length M-programs”, in the definition of recognition by a finite monoid M, al- 
lows automata to capture many subclasses of NC^ [3,5,13]. 

Here we consider the complexity of evaluating inverse morphisms and mor- 
phisms h : E* —f A*. Specifically, we consider the simple problem eval of 
computing the image of a word v under h, the problem range of determining 
whether a word w G h{S*), and the problem inv of computing an element 
of h~^{w) given w € h{S*). We examine the fixed setting, in which the mor- 
phism is input-independent, and the variable setting, in which the morphism is 
defined as part of the input. 

The general framework of our results is summarized in the following fig- 
ure. In the fixed case, the eval problem characterizes the relation between the 
classes NC°, AC'’ and TC'’, and the problems range and inv are closely re- 
lated to the class NC’^. Membership of inv in NC’ is then to be contrasted with 
Hastad’s result that there exists a fixed NC°-computable function whose associ- 
ated inversion problem is P-complete [10]. We also observe that 
Closure{<^^ ,HOM~^) = TC”, where HOM~^ denotes inverse morphisms, 

* On sabbatical leave from the Universite de Montreal until August 1998. 
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yielding yet another characterization of this important subclass of NC^. In the 
variable case, the problem eval remains in TC° while the range and inv prob- 
lems capture complexity classes between L and NL. 



Problem 


Eixed setting 


Variable setting 


evaluation 


isometric: NC'^ 
nonisom.: TC°-complete 


isometric: AG*^ 
nonisom.: TG°-complete 


range 


any h\ NC^ 

chosen h: NC^-complete [1] 


h{E) prefix code: L-complete 
unrestricted h-. NL-complete 


inversion 


NG'^ 


(Functional-L)^'' ^ 



Here we do not distinguish between a circuit-based language class and its 
functional counterpart. A morphism h : S* A* is isometric iff h applied to 
each a S 17 yields a word of the same length. 



An important part of our results, motivated by recent interest in classes interme- 
diate between L and NL [2,12,14], is the investigation of the problem range in 
the variable case. Restricting the underlying morphism affects the complexity 
of the range problem, e.g. the range problem for prefix codes is in L and is 
complete for this class. Now, one might expect that imposing the code property 
on h{S) should render the range problem complete for an unambiguous logspace 
class. However the resulting problem remains NL-complete. We therefore de- 
velop properties of codes and of sets of words, in particular the stratification 
property (see Section 4.2), which yield range subproblems of complexity iden- 
tical to the complexity of the graph accessibility problems introduced to study 
unambiguous logspace classes (see Section 2). In particular, we show that the 
problem range in which h(S) is a stratified code is many-one equivalent to the 
GAP problem Lgtu capturing StU SPACEifogn) , and that a variant of range in 
which h{E) is a stratified left partial code is RUSPACE{logn)-complete. This is 
particularly interesting since RUSPACEifogn) was only recently found to have 
a complete problem [12]. 

Due to space restrictions many of our constructions cannot be given in detail. 
In particular, the proofs of corollary 3 and theorems 4, 7, 8, 9, and 10 have to 
be postponed to a full version of this paper. 

2 Preliminaries 

2.1 Complexity Theory 

We assume familiarity with basic complexity theory. In particular, recall NC° C 
AC° C TC° C NCi C L C StUSPACE{logn) C RUSPACE{\ogn) C UL C 
NL CPC NP, where UL is the set of languages accepted by logspace Turing 
machines which are nondeterministic with at most one accepting computation, 
RVSPACEifogn) is defined like UL with the stronger condition that, on any 
input, at most one path should exist from the initial configuration to any ac- 
cessible configuration y, and StUSPACEifogn) is defined like UL with yet the 
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stronger condition that, between any pair of configurations (x,y), at most one 
path should exist from x to y [12]. Furthermore, Functional-L, also denoted FL, 
is the functional counterpart of L, i.e. the set of functions computable by de- 
terministic Turing machines and FL-^^ is the set of functions computable by a 
deterministic Turing machine M having access to an NL-oracle. 

In the usual way, DLOGTIME uniformity of a circuit family refers to the 
ability for a Turing machine equipped with an index tape allowing direct access 
to its input to recognize in time O(logn) the extended connectivity language of 
a circuit. For precise details on circuit descriptions, see [4]. 

Just as GAP is NL-complete and outdegree-one GAP is L-complete [9], the 
obvious GAP problems Lstu, Lm, and L„, which are respec- 
tively StUSPACE(logn)-]ia,Td, RUSPACE(logn)-ha,rd, and IZGhard, were intro- 
duced for topologically sorted graphs 

in [2,12]. Lru is RUSPACE{logn)-complete [12] while Lgtu is not known to belong 
to StUSPACE{logn). Using the closure under complement of NL L„ is already 
NL-complete. 

We will make use of the reducibilities and , which refer to many- 

one and Turing AG° reducibilities respectively. 

2.2 Problem Definitions 

Fix a morphism h : E* ^ A* for finite alphabets E and Z\, with E assumed 
ordered. In the fixed setting, the three problems of interest in this note are: 

eval(/i) Given v € E*, compute h{v). The decision problem has b G E and 
j G N as further inputs, and asks whether b is the jth symbol in h{v). 
range (/i) Given w G A* , determine whether w G h(E*). 

inv(h) Given w G A*, express w as h{ai^)h{ai^) ■ ■ ■ h{ai^)w' such that, first, 
[w'l is minimal, and second, v := ■ ■ ■ ai^. is lexicographically minimal 

with respect to the ordering of E. The decision problem is obtained by adding 
j G N as an input parameter, and asking for the jth bit in the representation 
of w. 

In the variable setting, the three problems of interest are eval, range, and 
inv, defined as above, except that the alphabets E and A, and the morphism h, 
now form part of the input. 

Fix a finite alphabet E. Let v = aitt 2 ■■■ an, Ui G P,n > 0. The length of v is 
written |?;|, and #a('y) represents the number of occurrences of a G U in For 
0 < i < j E n, we define iVj as a^+i • ■ ■ aj. In particular, i-\Vi is if i > 1. 

We encode our problems, in both the fixed and the variable settings, as 
binary strings. In the fixed setting, a word v over an alphabet P is encoded 
as a sequence of equal length “bytes”, each representing a letter in E. By the 
predicate Qa{v,i), a G E, v G E*, we mean that i-iVi is a (with Qa{v,i) false 
when i > |ri|). In the variable setting, we write Qa{v, i) as Q{a, v, i) in view of the 
input-dependent alphabet E. Any encoding which allows computing Q(a,v,i) 
in AG'’ suffices. 
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To encode a morphism h : E* A* (only required in the variable setting), 
it suffices that the predicate associated with h, denoted H{b, a,j), where b G A, 
a G E, and j > 0, meaning that j-\h{a)j is b, be AC°-computable. For 
h : E* ^ A*, we define max(/i) as max{|/i(a)| : a G E}. The predicate H{b, a,j) 
extends to H(b, v,j) for v G E* in the obvious way, and computing this extension 
is the object of the eval problem. We define H{b,v,j) to be false if j > |/i(r’)|- 

3 Fixed Case 

Throughout this section we fix ft. : A* ^ Z\*. We write the associated predicate 
H{b,v,j)^ b G A, V G E*, j G N as Hb{v,j), in view of the fixed alphabet E. 

3.1 Evaluation 

The complexity of evaluating ft is low, but it depends in an interesting way on 
whether ft is isometric^ i.e. whether the length of any h(w) only depends on |w|. 

Proposition 1. eval(ft) e Functional- NCP if h is isometric. 

Proof. Let ft be isometric and let c = |ft(a)| (= max(ft)) for any a G E. Then, 
for V G E*, b G A, and j G N, we have H}j{v,j) ^ (3a G E)[Qa{v., \j/c\) A 
p[b{a,j mod c)]. Since E is fixed, the existential quantification over a can be done 
in NC°. Hence, for each j , 1 A j A c-\v\, there is an NC° subcircuit Cj computing 
the jth symbol in h{v). The circuit for eval(ft) is a parallel arrangement of 
these Cj, 1 < j < c- |u|. Uniformity will be argued in a full version of this paper. 

It follows that, when ft is isometric, the decision problem eval(ft) G AC'’. 
Interestingly, the converse also holds: if ft is not isometric, then eval (ft) ^ 
Functional- AC*’ and the decision problem eval(ft) ^ AC®, as follows from: 

Theorem 1. If h is non-isometric, then the decision problem eval(h) is TCP- 
hard under . 

Proof. Since ft is not isometric, there exist a,b G E, s,t G N, such that |ft(a)| = s 
and |ft(ft)| = s -I- t with t > 0. We claim that MAJORITY, i.e. the language of 
all words v over the alphabet {a, b} having \v\/2 < #b(w), -reduces to com- 
puting the length of h{v). This implies that the extended predicate Hb{v,j), 
V G E*, is TC'’-hard, because |ft(w)| is trivially expressed as the largest j, 
1 < j < kl ■ max(ft), such that \J Hb{v, j). And computing the extended 
predicate Hb{v,j) is precisely the decision problem eval(ft). 

To prove the claim that MAJORITY reduces to computing |ft(r>)|, note that 
|^'|/2 < ffb{v) iff s • |v| -l-t • |n|/2 < s ■ \v\ -\- 1 ■ ffb{v) ■ Now the left hand side of the 
latter equality can be computed in AC® because s and t are constants. As to the 
right hand side, it is precisely |ft(r;)| = s-ffa{v)-\-{s-\-t)-ffb{v) = s- \v\-\-t-ffb{v)- 
Hence one can test \v\/2 < ffb{v) in AC'’, once an oracle gate provides the value 
of |ft(t’)|. Hence MAJORITY -reduces to computing |ft(f)|. 
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Corollary 1. The decision problem eval(/i) is in ACP iff h is isometric. 

On the other hand, it can be shown that even a variable morphism can always 
be evaluated in TC°, see subsection 4.1. Hence, the reduction from MAJORITY 
to eval(/i) exhibited in Theorem 1 can in fact be reversed: 

Theorem 2. For each morphism h, the decision problem eval(ft.) G TCP. 

Corollary 2. If h is a non-isometric morphism, then the decision problem 
eval(ft.) is TCP -complete under . 

Corollary 3. TCP = Closure{<^^ ,HOM~^). 

3.2 Range Membership 

We now turn to the problem of testing whether a word w G A* belongs to h{E*). 
Since E* is regular and the regular languages are closed under morphisms, h{E*) 
is regular. Hence range(ft,) G NC^. On the other hand, Schiitzenberger in 1965 
constructed the following family of finite biprefix codes over the binary alpha- 
bet {a, 6}: Cn := {a’^, a^~^ba, a’^~^b, ba"~^} U {a^ba^~^~^ j 1 < i < n — 3}. 
Schiitzenberger [16] proved that, for each n, the symmetric group S'„ divides 
(i.e. is an epimorphic image of a submonoid of) the syntactic monoid of the 
Kleene closure C* of C„. Using Theorem IX. 1.5 of [17] we get the following 
result. Again, details are left to to a full version of this paper. 

Theorem 3. For every morphism h, range^"/!^ G NC^. Furthermore, there ex- 
ists a morphism h such that range(h) is NC^ -complete under <m^ ■ 

3.3 Inversion 

Recall the definition of problem inv(/i). By using inv(/i) to determine whether 
[w'l > 0, the decision problem range(/i) reduces to the decision problem inv(ft.), 
under if an obvious encoding is chosen. Hence there is an h such that the 

decision problem inv(/i) is NC^-hard (by Theorem 3). Our goal in this section is 
to show that inv(/i) is in Functional-NC^, which holds, clearly, iff the decision 
problem inv(ft.) is in NC^. 

Suppose that a word w G h{E*). Then if h{E) were a code, i.e. if it had 
the property that any word in h{E*) is uniquely expressible as a sequence of 
elements from h{E), then computing h~^{w) would simply require computing 
all the positions i,f<i< [w], at which w splits into w = aj3 with a, /3 G h{E*). 
This set of positions would uniquely break w up into words from h{E). Since 
these positions can be computed in NC^ because h{E*) is a regular language, 
this strategy would solve such instances of inv(/i) in NC^. Interestingly, we can 
combine this strategy with a greedy approach and solve the general case in 
which h{E) is not a code. For lack of space we have omitted the proof of the 
next theorem. 

Theorem 4. For every morphism h, \rw(h) is in Functional-NC^ . 
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4 Variable Case 

4.1 Evaluation 

The fixed case construction for eval(h) carries over to the variable case eval. 
We must compute the predicate H(b, v,j), i.e. determine whether the jth symbol 
in h{v) is b. In the following let 7r(?;) be the Parikh vector of a word v and Mh 
the growth matrix of the morphism h, i.e. {Mh)ik is the number of bk symbols 
in h{ai). Then, H{b,v,j) holds iff there exists i G N and a G S such that 
Tr{oVi-i)Mhl < j < ^^{ov^)Mhl and Q{a,v,i) and = b. 

Now the computation of Mh out of h and the computations of 7r(or;i-i)M;il 
and 7r{oVi)Mhl out of v can be done in TC°. Indeed, the integers involved 
in these computations are small, i.e. their absolute values are polynomial in 
the input size, aird arithmetic with such integers can even be carried out in 
AC° [4, Lemma 10.5]. But the computation of the Parikh vectors needs couirting 
and is TC°-hard. Hence computing H{b, v,j) is TC°-hard aird eval e Fuirctioiral- 
TC°, which we record as: 

Theorem 5. The decision problem eval is TCP -complete w.r.t. . 

We note that eval restricted to isometric morphisms is in AC'’. 



4.2 Range Membership 

Our problem range is precisely the Concatenation Knapsack Problem considered 
by Jenner [8], who showed that: 

Theorem 6. (Jenner) range is NL-complete. 

Recall that a nonempty set C C A* is a code if C freely generates C* , and 
that C is a prefix code if C has the prefix property, i.e. if no word in C has a 
proper prefix in C. Define prefixcoderange to be the range problem restricted 
to input morphisms h : E* A* such that h{E) is a prefix code. The easy proof 
of the following will be given in a full version of this paper. 

Theorem 7. prefixcoderange is L-complete. 

In view of Theorems 6 aird 7, it is natural to ask for properties of h{E) 
allowing the range problem to capture concepts between L and NL like symme- 
try or unambiguity. For instance, one might reasonably expect codes to capture 
rmambiguity, and thus the obvious problem co derange to be complete for one 
of the unambiguous space classes between L and NL. But the mere problem of 
testing for the code property is NL-hard as shown by Rytter [15]. In the follow- 
ing subsection, we introduce a more elaborate approach which is able to capture 
unambiguity. 
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4.2.1 Stratified Sets of Words and Morphisms 

Definition 1. a) Let C C A* he arbitrary. Define a relation pc ^ A x A as 
O' Pc b iff some word in C eontains ab as a subword. 

b) C is said to be stratified if pc forms a unique maximal aeyclic 
ehain oi pc 02 pc ■ ■ ■ PC On/ in that case, the word a{C) := ai02 ■■■ an & A* 
is called the stratification ofC. (Example: The set {a,b,ab,bc} C {a,b,c}* is 
stratified, while {a,b}, {ab,ba} and {ab,ac} are not.) 

Let C G A* he stratified, where we assume from now on that such a stratified 
set makes use of all the letters in A. Clearly, any word x G C is a subword 
of cr(C'). Then, any word w ^ A* \s uniquely expressible as w = 0102 . . . Ofc, 
where each ai is a maximal common subword of w and cr(C). 

Proposition 2. Let C C A* be a stratified set. 

a) For any w € A* , w € C* iff its canonical decomposition w = a\a2 ... ak 
into maximal subwords common with a{C) satisfies ai G C* for each i. 

h) C is a code iff each subword of its stratification a{C) is expressible in at 
most one way as a concatenation of words from C . 

Testing whether a finite set C C A* is stratified is L-complete. On the 
other hand, although stratification may seem like an overwhelming restriction, 
stratifiedrange, i.e. the range problem for stratified h{S), is NL-hard: 

Theorem 8. stratifiedrange is NL-complete. 

Proof. To see that stratifiedrange is in NL, we first test deterministically in 
log space whether C is stratified. Then we use the fact that range € NL. The 
construction used to show NL-hardness is strongly inspired by a proof in [7] but 
has to be omitted because of lack of space. 

Hence, like coderange, but for a different reason, stratifiedrange is NL- 
complete and thus does not appear to capture an unambiguous logspace class. 
It is stratifiedcoderange, namely the problem range restricted to the case 
of a stratified code h{S), which bears a tight relationship to StUSPACEflogn). 
Again, we have to postpone the proof of the following theorem to a full version 
of this paper. 

Theorem 9. stratifiedcoderange and Lstu are many-one logspace equivalent. 

Hence stratifiedcoderange is StUSPACE{\ogn)-hai:d, and although we are un- 
able to claim a complete problem for StUSPACEifogn), we have come very 
close, since even the StUSPACE{\ogn)-hard language Lst«, specifically tailored 
to capture StUSPACEilogn), is only known to be in RUSPACEilogn) [12]. On 
the other hand, we can claim a complete problem for the unambiguous class 
RUSPACEilogn), as follows. 

Proposition 2 states that the code property of a stratified set C translates 
into the unique expressibility of every expressible subword of the stratification 
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of C as an element of C* . Let us relax this condition and define a stratified set C 
to be a left partial code if no prefix of cr(C') can be expressed in two ways as an 
element of C* . Furthermore, we let Ipcstratification be the special case of the 
leftpartialcoderange problem in which we are only asked to determine whether 
a{h{S)) G h{S*), i.e. Ipcstratification is the language of all morphisms h : 
E* — !■ A* such that h{E) is a left partial code and the stratification of h{E) is 
in h{E*). Then 

Theorem 10. Ipcstratification is RUSPACE{logn)-complete. 

Proof sketch. The construction used in Theorems 8 and 9 in effect proves that 
Ipcstratification and Lm are many-one logspace equivalent. Here Lm is the 
language of graphs having a path from source to target, but required to possess 
the unique path property only from the source to any accessible node. This is 
precisely the property which corresponds to the action of canonically expressing 
the stratification of a left partial code. We then appeal to the main result of [12], 
namely that Lm is RUSPACE{logn)-complete. 

4.3 Inversion 

The class (Functional-L)-^^ can be defined equivalently as the set of functions 
computed by single valued fVL-transducers. We have: 

Theorem 11. Problem inv is in (Functional-L)^ ^ . 

Proof. To solve inv, we use the algorithm and the automaton constructed in 
Theorem 4. Here, instead of first producing w, we start the deterministic logspace 
simulation of the automaton. Whenever the simulation would require reading a 
letter from w, we appeal to an NL-oracle to test whether the current input 
position breaks w up into a prefix and a suffix in h{E*) (an NL-oracle can test 
this by Theorem 6). This concludes the proof. 

We remark that the deterministic version of inv, namely the restriction of 
inv in which is a prefix code, belongs to Functional-L. 

Since computing the inv function allows answering the range question, we 
have NL C and by the previous theorem NL = This implies that 

FL^^ = FL^'^'^, i.e. that the inv function is Turing-complete for FL^^. 
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Abstract. In the literature, there are quite a few sequential and par- 
allel algorithms to solve problems in a distance-hereditary graph G uti- 
lizing techniques discovered from the properties of the problems. Based 
on structural properties of G, we first sketch characteristics of prob- 
lems which can be systematic solved on G and then define a general 
problem-solving paradigm. Given a decomposition tree representation 
of G, we propose a nnified approach to construct sequential dynamic- 
programming algorithms for several fundamental graph-theoretical prob- 
lems that fit into our paradigm. We also show that our sequential solu- 
tions can be efficiently parallelized using the tree contraction technique. 



1 Introduction 

A graph is distance-hereditary [2,11] if the distance stays the same between any 
of two vertices in every connected induced subgraph containing both (where 
the distance between two vertices is the length of a shortest path connecting 
them). Distance-hereditary graphs form a subclass of perfect graphs [7,10,11] 
that are graphs G in which the maximum clique size equals the chromatic num- 
ber for every induced subgraph of G [9]. Two well-known classes of graphs, trees 
and cographs, both belong to distance-hereditary graphs. Properties of distance- 
hereditary graphs are studied by many researchers [2,3,4,5,6,7,8,10,11,12,13] 
which resulted in sequential or parallel algorithms to solve quite a few inter- 
esting graph-theoretical problems on this special class of graphs. 

Known polynomial time algorithms on distance-hereditary graphs utilize 
techniques discovered from the properties of the problems. In this paper, we pro- 
pose a different approach. Given a graph problem, we say it belongs to the class 
of subgraph optimization problem if the object of this problem aims at finding a 
subgraph of the input graph to satisfy the given properties which includes an op- 
timization constraint. For example, the problem of finding a maximum indepen- 
dent set is a subgraph optimization problem. By discovering fundamental proper- 
ties of distance-hereditary graphs, we define a general problem-solving paradigm 
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for subgraph optimization problems. Problems fit into our paradigm include 
(a) the maximum independent set problem, (b) the maximum clique problem, 
(c) the Steiner tree problem, (d) the vertex connectivity problem, (e) the edge 
connectivity problem, (f) the dominating set problem, (g) the independent dom- 
inating set problem and (h) the connected dominating set problem. We develop 
a systematic approach to construct dynamic programming algorithms. For the 
above problems, our constructed algorithm run in 0(n+m) time, where n and m 
are the number of vertices and edges, respectively. In addition to the sequen- 
tial solutions, we also show that our general problem-solving paradigm can be 
efficiently parallelized if a binary tree representation of a distance-hereditary 
graph is given. The parallel complexities of problems (a)-(h) are 0(log^ n) time 
using 0 {n + m) processors on a CREW PRAM. If a binary tree representation 
is given, all but the problem (c) can be solved optimally in O(logn) time using 
0 {n/ logn) processors on an EREW PRAM. 

2 Preliminaries 

This paper considers finite, simple, undirected graphs G = {V,E), where V 
and E are the vertex and edge sets of G, respectively. Let n = \V\ and 
m = \E\. For standard graph-theoretic terminologies and notations, see [9]. 
Let G[X] denote the subgraph of G induced by A C V. The union of two 
graphs Gi = (Vi,Ei) and G2 = {V2,E2) is the graph G = (Pi U V2,Ei U E2). 
The class of distance-hereditary graphs can be defined by the following recursive 
definition. 

Definition 1. [4] (1) A graph consisting of a single vertex u is a primitive 
distance-hereditary graph with the twin set {?;}. (2) If Gi and G2 are distance- 
hereditary graphs with the twin sets Si and S2, respectively, then the union 
of Gi and G2 is a distance-hereditary graph G with the twin set SiU 82- In this 
case, we say G is formed from Gi and G2 by the false twin operation ©. (3) If Gi 
and G2 are distance-hereditary graphs with the twin sets Si and S2, respectively, 
then the graph G obtained from Gi and G2 by connecting every vertex of Si 
to all vertices of S2 is a distance-hereditary graph with the twin set SiU 82- In 
this case, we say G is formed from Gi and G2 by the true twin operation ©. 
(4) If Gi and G2 are distance-hereditary graphs with the twin sets Si and S'2, 
respectively, the graph G obtained from Gi and G2 by connecting every vertex 
of Si to all vertices of S2 is a distance-hereditary graph with the twin set Si . In 
this case, we say G is formed from Gi and G2 by the attachment operation 0. 

The above definition implies that a distance-hereditary graph can be repre- 
sented by a binary tree form, called a decomposition tree. The complexities of 
constructing a decomposition tree are shown below. 

Lemma 1. [ 4 , 14 ] ^ decomposition tree of a distance-hereditary graph can be 
constructed in sequential 0{n + m) time, and in parallel 0(log^ n) time using 
0 {n + to) processors on a CREW PRAM. 
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3 A General Problem-Solving Paradigm 

Throughout this paper, we assume each distance-hereditary graph under consid- 
eration is represented by its decomposition tree. For a vertex x in a decomposi- 
tion tree of G, let T{x) be the subtree rooted at x and let Gx be the subgraph 
of G induced by the leaves of T{x). Also let Sx be the twin set of Gx and 
let Vx = V{Gx)- Let H he a, subgraph of G and X be a subset of V. 

Definition 2. Define 'Pk (G, H) to be the problem of finding a subgraph Q in H 
such that the following two properties hold: (1) the graph G[V \ V{Q)] satisfies 
the given property V, (2) there exists a component C in G\V \ V{Q)] satisfying 
V{C)f]X = 0. For convenience, we use Vx{G) to represent Vx{G, H) li H = G, 
and let Px(G) denote a solution of Vx{G). The superscript 1 in the notations 
used represent the problem under consideration as type 1. 

Definition 3. Define Vx(G,H) (respectively, V^{G, H)) to be the problem of 
finding a subgraph Q in H such that the following two conditions hold: (1) Q sat- 
isfies the given property V, (2) there exists Q' QQ such that for each connected 
component G,, {y{Q')(^V{Gi))CiX ^ 0 (respectively, {y{Q')(^V{Gi))CiX = 0) 
if V{Q') n V{Ci) yf 0 and A 0. For convenience, we use 'P\{G) (respec- 
tively, 7^A(G)) to represent Vx{G,H) (Vj^(G, H)) if H = G, and let Px{G) 
(respectively, PfL(^G)) denote a solution of Vx(G) (respectively, piL{G)). The 
superscript 2 in the notations used represent the problem under consideration 
as type 2. 

Let A{G) and Bi{Gi), 1 < i < fc, be graph problems with the input graphs G 

and Gi, respectively. A reduction scheme from Bi's to A, denoted by A(G) < — ^ 
(i?i(Gi), i? 2 (G 2 ), . . . , Bk{Gk)) means that a solution of A(G) can be obtained by 
merging solutions of Bi{Gi)’s in 0{f{n, k)) time, where n = |y(G)|. We also say 
that A(G) is reduced to Bi{Gi), B 2 {G 2 ), ■ ■ ■ , Bk{Gk)- We say the above scheme 
a polynomial (respectively, constant) time reduction scheme if 0(/(n, k)) = 
0{n‘), I is a constant (respectively, 0{f{n,k)) = 0(1)). For convenience, the 
superscript f{n, k) in the notation is omitted if /(n, k) = 0(1). We also use f{n) 
to denote /(n, k) if A: is a constant. In the following sections, G is a distance- 
hereditary graph formed from Gi and G 2 with the twin sets Si and S 2 , respec- 
tively. Let S be the twin set of G. 

3.1 Perfectly Dividable Problem and Dividable Problem 

Definition 4. Problem P^{G), i G {1,2}, is perfectly dividable on G if the fol- 
lowing two conditions hold. (1) Given X G (0, S}, there exists Y G (0, Si} 

and Z € { 0 ,^ 2 } such that P^(G) (P{.(Gi), P^(G 2 ), P^(G, G[^i U ^ 2 ])). 

(2) V^{G,G[Si U S' 2 ]) can be solved in 0(n'^) time, where g is a constant. 

Given a decomposition tree, we have the following result based on the dy- 
namic programming technique. 
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Theorem 1. Let V^{G), i € {1,2}, be perfeetly dividable. V^iG) ean be solved 
in 0{n ■ {f{n) + n^)) time. Moreover, if f{n) = 0(1) and V^{G, G[S'i U S' 2 ]) can 
be solved in 0(1) time, then V^{G) ean be solved in 0{n) time. 

Definition 5. Problem V^{G), i € (1, 2}, is said to be dividable if the following 
condition holds. Given X € (0, S, S'} and Y € (G, G[P\S]}, there exist k = 0(1) 
problems B/s in {7^^(Gi), (Gi), 7^|-(Gi), 7^^(G[Pi \ S,]),Vh,{G[V, \ Si])}, 
and k problems G^-’s in {V^^iG 2 ),Vs^{G 2 ),Vi^{G 2 ),V^{G[V 2 \ S 2 ]), 
7 ^g^(G[V 2 \ S 2 ]) such that there is a reduction scheme from Bj’s and Cj’s to 

vi{Y). 

Theorem 2. Let V^{G), i G {1,2}, be dividable. V^{G) can be solved in 
0{f{n) ■ n) time. Moreover, if f{n) = 0(1), then V^{G) can be solved in 0(n) 
time. 



3.2 Parallel Implementation of Paradigm 

In this section, we define two systems on a given decomposition tree, called closed 
system A and closed system B, which can be utilized to parallelize the dividable 
and perfectly dividable problems using the binary tree contraction technique 
described in [1]. The binary tree contraction technique recursively applies two 
operations, prune and bypass, to a given binary tree. Prune{u) is an operation 
which removes a leaf node u from the current tree, and bypass{v) is an operation 
(following a prune operation) that removes a node v with exactly one child w 
and then lets the parent of v become the new parent of w. 

We define four operators working on graphs. Given a set of subgraphs H = 
{Hi, H 2 , . . . , Hk} of G, where k > 2, the operator Min„ (respectively, MiNg) 
is applied on H which returns a subgraph in H with the minimum number of 
the vertices (respectively, edges). The operators Max„ and MAXg are defined 
similarly. 

Definition 6. Let Pg(G), i G {1,2}, be a dividable problem and Dq be a 
decomposition tree. The closed system A on Dq is defined as follows. Initially, 
each leaf I of Dc (representing a primitive distance-hereditary graph Gi) is 
associated with five solutions of V^{Gi), \ *5'/]), and 

Vg^ {G[Vi \ S'/]). Assume is a given operator in { MiN„, MiNe, Max„, Max^ }. 
For each internal node v with the left and right children u and w, respectively, 
the solutions of 1P^(G„), V^g^ (G„), V^{G,), 1P^(G[K \ S„]), and (G[K \ S„]) 
can be obtained by the following rule: 



Pi{G„) = <L{Cli U CZi, 
PhAG„)=L>{CliUG^^i, 
P;iG[V„ \ S.]) = <P{Cli U G 3 - 1 , 
P^s^iG[VAS.])=<P{CliUCli, 
P^{G„) = d^iGh U Gs^i 



C U I I /^U 

12 U Oi 2 , • • • , 

I t /^U 

^2,2 ^ ^2,2’ • ■ • 5 

C U I I 

3,2 ^ ^3,3’ • • • 5 ^3,r 

C U I . /^W /^U 

4,2 ^ ^4,2? • • • ? ^4,s 

C U I I f^U 

5,2 ^ ^5, 25 • • • 5 ^5,i 



uG};^, 

UG^^J, 

UG 3 -J, 

UG4™J, 

UG 5 -}, 



( 1 ) 

(2) 

( 3 ) 

( 4 ) 

( 5 ) 



Characterization of EfHcientiy Soivabie Probiems 261 



where p, q, r, s and t are all constants and S {P^iGu), Ps^ (G„), 

P;{G[Vu \ Su]),P^s^G[Vu \ 5„])} and G™, € {^eMG^), P^JG^), P^(g1), 
P^{G[Vw \ Sw]),Ps^ (G[T4) \ 5'uj])}. Let 7 be the root of Dq- The goal is to find 
a solution oiVl{G^) ='P^{G). 

During the process of tree contraction, we construct the following five func- 
tions associated with each node v G V{Dc)- Let Xi,X 2 ,X 3 ,X 4 and X 5 be inde- 
terminates that stand for the possibly unknown solutions of 'Pl{Gv),'Pg (G„), 
V^(G[Vy \ 5^]), Ps„(G[i4 \ 5^]) and V^{Gy), respectively. The functions associ- 
ated with V possess the following form: For 1 < j < 5 , X2, X3, X4, X5) = 

^ U Oaj 1 , Xa^ 2 U 2 , . . . , Xa-^^. U }, where 1 < apk < 5 and ^ is 
a subgraph of Gy. We call the above functions goal functions and their common 
form as closed A form. Let v' be the child of par(v) which is an ancestor of v 
or V itself in the original Dq. We let the functions Ff and F^ to 

represent P^{Gy>),P^g^^{Gy>), P^(G[Vy> \5'^']), Ps^,(G[Vy> \ 5'^']) and 
respectively. 

During the process of executing the tree contraction, some nodes are removed 
and the schemes of the remaining nodes are adjusted such that the functions re- 
main closed A form and the following invariant is maintained. 



Invariant (I): Let v be an internal node of the current tree such that v holds <P 
and U operators which generate the solutions of V^{Gy) ,Vg^{Gy) ,V^{G[Vy\Sy]) , 
Pg^(G[i 4 \S'„]) and V^ifGy) based on Equations (l)-( 5 ). Let u (respectively, w) 
be the left (respectively, right) child of v in the current tree whose associated 
functions are P“(Ai, A2, A3, A4, A5) (respectively, F“(Ai, A2, A3, A4, A5)) for 
all 1 < f < 5 . For each Ch G {P0*(G„), P^JG„), P0*(G[K \ 5 „]),P^JG[K \ 
Su]),P§^iGu)} (respectively, Cf i G {P^{Gyy), P^s^{Gy,), P},{G[Vy, \ 5 ^]), 
P5^(G[T4;\<S'u,]), P^(Giu)}) in Equations (l)-( 5 ), we replace it with F^;(Ai, . . . , 



A5) G{Fi“(Ai,...,A 5),P2“(^1,---,^5),P3“(^1,---,^5),P4“(^1,---,^5 ), 
F5“(Ai,...,A 5)} (respectively, F,l",(Ai, . . . , A5) G {Ff (Ai, . . . , A5), 
F^ (Ai , . . . , As) , F3- (Ai , . . . , As) , F4- (Ai , . . . , A5) , (Ai , . . . , A5)}) that rep- 

resent G^; (respectively, G™j), where u' (respectively, w') is the child of v in the 
original Dq which is an ancestor of u (respectively, w) or u (respectively, w) it- 
self. For brevity, we use F"^, (respectively, Fj"^,) to denote F“j,(Fg(G„), Pg (Gy), 
P^{G[Vy\Sy]), 

P^s^ (G[K \ 5„]), Fi-(G„)) (respectively, F^^{P;{Gy,), F^^ (G^), P;{G[V^ \ 5^]), 
Pg^ (G[T4;\<S'u,]), P^(Gy,))) when there is no confusion. Then, P^{Gy) = <d>{Ff f^ 

PiTi, Pi“2 U Fi™ 2, . . . >i“p u Fi™p}; F^^ (G„) = <!>{Fl^ U F^^^, . . . , F^^ U 

P2“J; P0(G[K \ F„]) = u F3“i,Fg“2 U F3-2, . . . ,F3“, U F3™,}; F^JG[K \ 

5 .]) =^{P4"iUF4“ ,FK2UF4- ,...,F4“,UF4™ }; F^(G„) = <Z>{F 5“3UF5™ ,F5«2U 



Fs“2,...,Fs“*UFs“}. 

Let G 0 = (0,0). Initially, we define the functions for a node v G Dq to be 
(i) Fi"(Ai,A 2 ,A 3 ,A 4 ,As) =<!>{AiUai}, 
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Lemma 2. During the process of executing the binary tree contraction to remove 
some nodes, the goal functions of the remaining nodes can be adjusted in 0(1) 
time using one processor such that the Invariant (I) is maintained. 

Proof. The proof is by induction on the number of contraction phases. The 
Invariant (I) holds trivially on the original Dq along with the initial functions 
defined on V{Da). We use the tree contraction algorithm to reduce the given 
input tree Dq into a three-node tree T' such that solving closed system A on Dq 
is equivalent to solving closed system A on T' . We augment the prune and 
bypass operations by constructing corresponding functions so as to maintain the 
invariant (I). 

In the execution of the tree contraction, assume prune(u) and bypass{par(u)) 
are performed consecutively. Assume u is the left child of v (the case of u being 
the right child can be similarly computed). Let par(u) = v and sibiu) = w. Note 
that u and v are removed; hence, their contributions to the five target solutions 
computed at node par{v) have to be incorporated into the functions associated 
with w. Assume, without loss of generality, that v contains the operators ^ 
and U which perform Equations (l)-(5). The five target solutions associated 
with node v is given by 



{Qp U Qfci), ATc2 U {Qp Uac.^),...,Xct U {Qp U Oc*)}, } (distributive law) = 



a'c2, • ■ • , ATcj U a'ct} (associative law) = d>{Xdi U ((Ujja'a^. | aj = di}) U 



dk}) U bj = dfe}) . . . , (Ujja'cj I cj = dfe}))} (communicative and 

distributive laws) = d>{Xd^ U fIdi,Xd^ U (3d2, ■ ■ ■ ^ ^dk U /?dfc} (associative 
law), where Xi, 1 < i < 5, are unknown target solutions of node w, as- 
suming that the invariant (I) holds before the prune(u) and bypass(v) op- 
eration. The following four terms can be simplified similarly. Pg^iGy) = 





^{Xa^\Ja'a^,Xa^ U 

...,Xy,VJa'c„Xy^\J 



X-l)2 II O- 1 ' ' ‘ ^^bs II 
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Hence, the contribution of five target solutions to the node w is given by 

(i) Ff(Xi,X2,X3,X4,X5) = 

Sv]),P^{Gv)) = ^{Xhi U ahi , Xh^ U 0^2 , . . . , Xh^ U } (the process of simpli- 
fying is similar to simplify the function corresponding to P^{Gy) describe in last 
paragraph). The following functions can be further simplified as closed A form. 

(ii) F^{X,,...,X 5 ) = F^. (hi) F^(X,,...,X 5 ) = F^. (iv) Ff(Ai,...,A 5 ) = 

F 4 . (v) F^{Xi , . . . , X 5 ) = Fg . Therefore, the invariant (I) is then maintained. 
By the definition of the closed A form and the composition of the goal function, 
the complexities can be done. □ 



According to Lemma 2, the closed system A can be implemented in 0(log n) 
time using 0(n/ log n) processors on an EREW PRAM. Hence, we have the 
following theorem. 

Theorem 3. Let V^iG), i G {1,2}, be a dividahle problem and let <P be a given 
operator in { MiN^,, MiNe, Max„, MAXg }. V^{G) can be solved in O(logn) 
time using 0(n/logn) processors on an EREW PRAM if the following condi- 
tion holds. Given X G {0,5, 5} and Y G {G,G[V \ 5]}, there exist two equal- 
cardinality sets B whose each element is in {Pg(Gi), Pg_^ (Gi), P-^(Gi), P^{G[Vi\ 
5i]), Pg_^ (G[Vi\5i])| and G whose each element is in {P 0 (G 2 ), Pg^ (G 2 ), P^{G 2 ), 
PJ(G[P2\52]),P^^(G[E2\52])} such thatP^^iY) = ^{B,UCk\ B, G R;Gfe G Gj. 

A problem P^{G) is said to be type i dividable problem with the closed A 
reduction scheme if it satisfies the condition of Theorem 3. 

Definition 7. Let ^^(G), i G {1,2}, be a dividable problem and Dq be a 
decomposition tree. The closed system B on Dq is defined as follows. Initially, 
each leaf I of Dq is associated with P^{Gi) and Pg^ (G;), and each internal node v 
with the left and right children u and w is associated with P^{Gy,G[Su U 5^;]). 
Assume ^ is a given operator in { MiNu, Max„, MiNg, MAXg }. The solutions 
of Fg(G„), Pg^{Gy) can be obtained by the following rule: 

P;{Gy) = <?{Gr,Gr,P0(G,,G[5„ U 5^])}, (6) 

P^g^ {Gy) = <P{G^, G^, P^{Gy,G[Sy U 5„])}, (7) 

where G“ G {Pi^iGu), Pg^{Gu)} and Cjf G {Pi^iGy,), Pg^{Gyj)}. Let 7 be the 
root of Do- The goal is to find a solution of V^{G-f) = V^{G). 

With the technique similar to implement the closed system A, we have the 
following result. 

Theorem 4. Let V^{G), i G {1,2}, be a perfectly dividable problem and be a 
given operator in { Min„, MiNg, Max„, MAXe}. Problem P^{G) can be solved in 
0(log^ n) time using 0{nj log n) processors on an EREW PRAM if the following 
two conditions hold. (1) Por all nodes v G V{Dg) with the left child and the right 
child u and w, respectively, V^{Gy, G[5„ U 5u,]) can be solved in 0(log^ n) time, 
k>\, using the tree contraction technique. (2) Given X G {0,5}, Y G {0,5i} 
and Z G {0, 52 }, Pi(G) = <P{Pf (Gi), P^(G 2 ), FgHG, G[5i U 52 ])}, 
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A problem V^{G) is said to be a type i perfectly dividable problem with the 
closed B reduction scheme if it satisfies the condition of Theorem 4. 



4 Complexities of Perfectly Dividable and Dividable 
Problems 

We first consider the vertex connectivity problem. A vertex (respectively, edge) 
separator of a graph is a minimal set of vertices (respectively, edges) whose 
removal make the given graph be disconnected or result in a trivial graph. A 
minimum vertex (respectively, edge) separator of G is a vertex (respectively, 
edge) separator with the minimum cardinality. We define the vertex (respectively, 
edge) connectivity problem VC(h{G) (respectively, £Cii,{G)) to be the problem that 
finds a minimum vertex (respectively, edge) separator Q of G. For a distance- 
hereditary graph G with the twin set S', a minimal vertex (respectively, edge) 
separator Q is called S-type if there exists a component H of G[F \ Q] such 
that V{H) n S = 0. VCs{G) (respectively, £Cs{G)) is the problem that finds a 
minimum S-type vertex (respectively, edge) separator of G. 

In the remainder of this section, assume G = (V, E) is a distance-hereditary 
graph formed from Gi and G2 with the twin sets Si and S2, respectively. Due 
to the space limitation, the proof of the following lemma is omitted. 

Lemma 3. Given a decomposition tree Dq of G, VC 0 (G, G[Si U S2]) can be 
solved in 0 {n) sequential time and in parallel O(logn) time using 0(n/logn) 
processors on an EREW PRAM. 

Given an operator in {MiN„, Min^, Max„, MAXg}, we let A{G) 
(6i(Gi),62(G2), . . . ,Sfc(Gfc)) to represent A(G) = ^{Bi\ 1 < i < k} and A(G) 
^ {Bi{Gi),B2{G2),...,Bk{Gk)) to represent A{G) = U{B,\ l<i<k}. 

Theorem 5. VCij){G) is a type 1 perfectly dividable problem with the closed B 
reduction scheme. Given a decomposition tree Dq, it can be solved in linear time 
and in parallel O(logn) time using 0(n/ log n) processors on an EREW PRAM. 

Proof. We first show VC$(G) is type 1 perfectly dividable. Let X G {0,S}. If 

G = Gi 0 G2, then VCx(G) (VCs, (Gi), VCs,(G2), VCb(G, G[Si U S2])). 

If G = Gi © G2, then VCe(G) (VCs,(Gi), VCs,(G2), VC0(G,G[Si U S2])) 

and VCs(G) (VCs, (Gi), VC0(G2), VC0(G, G[Si U S2])). If G = Gi © G2, 

then VCx{G) (VCjc(Gi), VC;f (G2), VC0(G, G[Si U S2])). By Lemma 3, 

VC0(G, G[Si U S2])) can be solved in linear time and in parallel O(logn) time 
using 0 {n/ log n) processors on an EREW PRAM. From Definition 4, Theorem 1 
and Theorem 4 the result holds. □ 



The following theorem can be obtained similar to the above discussion. 
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Theorem 6. £Cq{G) is a type 1 perfectly dividable problem with the closed B 
reduction scheme. Given a decomposition tree Dq, it can be solved in linear time 
and in parallel O(logn) time using 0(n/ log n) processors on an EREW PRAM. 

It can be verified that the maximum clique problem and the Steiner tree 
problems are both type 2 perfectly dividable problems with the closed B reduc- 
tion scheme. Given a decomposition tree, the problems can be solved in linear 
time and in parallel O(logn) time using 0(n/logn) processors on an EREW 
PRAM. 

An independent set of a graph is a subset of its vertices such that no two 
vertices in the subset are adjacent. The independent set problem ’I${G) is the 
problem that aims at finding a maximum cardinality independent set. Let G = 
(V,E) be a distance-hereditary graph and S be the twin set of G. An S-type 
independent set of G is a maximum independent set of G which contains at least 
one vertex of S. An S-type independent set of G is an independent set of G which 
contains no vertex of S. A minimum S-type (respectively, S-type) independent set 
of G, denoted by Is{G) (respectively, /^(G)), is an S-type (respectively, S-type) 
independent set of G with the minimum cardinality. Due to the space limitation, 
the proofs of the following lemma are omitted. 

Lemma 4. Suppose G is a distance-hereditary graph formed from Gi = (Vi, Ei) 
and G2 = (V2,L'2) with the twin sets Si and S2, respectively, by the true twin 
operation and S is the twin set of G. Then, (1) I{G) = MAX„{/g(G), 7s(G)}, 
(2)Is{G) = Max„{/si(Gi)U/0(G[P2\S2]),/0(G[Pi\Si])U/s,(G2)}, (3) I^{G) = 
%(Gi) U %(G2), U) h{G[V \ S]) = /0(G[Ei \ Si]) U /0(G[E2 \ S2]). 

Lemma 5. Suppose G is a distance-hereditary graph formed from G\ and G2 
with the twin sets Si and S2, respectively, by the attachment operation and S 
is the twin set of G. Then, (1) I(h{G) = Max„{/^(G), Js(G)}, (2) Is(G) = 
/s,(Gi)U/0(G[P2\S2]), (3)IsiG) = Max„{/0(Gi)U/s,(G2),%(Gi)U%(G2)}, 
(4) k{G[V \ Sj) = /0(G[Pi \ Sij) U %(G2). 

Since no vertex of Gi is adjacent to any vertex of G2 if G is formed from Gi 
and G2 by the false twin operation, the following lemma can be obtained. 

Lemma 6. Suppose that G is a distance-hereditary graph formed from Gi 
and G2 with the twin sets Si and S2, respectively, by the false twin operation, 
and S is the twin set of G. Then, (1) Iih{G) = MAXy{Is{G), I-g{G)}, (2) Is{G) = 
Max 4 /s,(Gi) U /0(G2), /0(Gi) U IsAG 2)}, (3) %(G) = %(Gi) U %(G2), 
(4) h{G[V \ Sj) = /0(G[Ei \ Sij) U /0(G[P2 \ S2]. 

Theorem 7. T^{G) is a type 2 dividable problem with the closed A reduction 
scheme. Given a decomposition tree Dq, 210(G) can be solved in linear time and 
in parallel O(logn) time using 0(n/ log n) processors an EREW PRAM. 



Proof. By Lemmas 4-6, Definition 5, Tt,{G) is a type 2 dividable problem. By 
Theorems 2 and 3, the complexities follow. □ 
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The domination problem, the connected domination problem, the indepen- 
dent domination problem can be verified as type 2 dividable problems with the 
closed A reduction scheme. Given a decomposition tree Dq, these problems can 
be solved in linear time and in parallel 0(log n) time using 0(n/ log n) processors 
on an EREW PRAM. 
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Abstract. The weighted independent domination problem in trapezoid graphs 
was solved in 0{n^) time [1]; the weighted efficient domination problem in 
trapezoid graphs was solved in 0(n log log n + m) time [8], where m denotes 
the number of edges in the complement of the trapezoid graph. In this paper, 
we show that the minimum weighted independent dominating set and the 
minimum weighted efficient dominating set in trapezoid graphs can both be 
found in 0(n log n) time. Both of the algorithms require only 0{n) space. Since 
m can be as large as comparing to previous results, our algorithms 

clearly give more efficient solutions to the related problems. 



1 Introduction 

The intersection graph of a collection of trapezoids with corner points lying on 
two parallel lines is called the trapezoid graph [5]. The fastest known algorithm 
for recognition of trapezoid graph is given by Ma and Spinrad in [11] in 0{n?) 
time. Throughout the paper, we use n (to) to represent the number of vertices 
(edges) in the given graph. Note that trapezoid graphs are perfect and properly 
contain both interval graphs and permutation graphs. Trapezoid graphs are per- 
fect since they are cocomparability graphs. Dagan et al. [5] show that the channel 
routing problem is equivalent to the coloring problems on trapezoid graphs and 
present an 0{nx) algorithm to solve it where x is the chromatic number of the 
trapezoid graph. Felsner et al. [6] design 0{n log n) time algorithms for chromatic 
number, weighted independent set, clique cover and maximum weighted clique 
for trapezoid graphs. Also, since Steiner [14] show that the Hamiltonian cycle in 
the cocomparability graph can found in polynomial time, the Hamiltonian cycle 
problem of trapezoid graphs is also solvable in polynomial time. 

A dominating set of a graph G = (V,E) is a subset D of V such that every 
vertex not in D is adjacent to at least one vertex in D. Each vertex v G V can 
be associated with a (non negative) real weight, denoted by w{v). The weighted 
domination problem is to find a dominating set, D, such that its weight w{D) = 
'^veD w(v) is minimized. A dominating set D is independent, connected or total 
if the subgraph induced by D has no edge, is connected, or has no isolated 

* Supported in part by the National Science Council, Taiwan, R.O.C, grant NSC-88- 
2213-E-126-005. 
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vertex, respectively. Dominating set problem and its variants (discussed later) 
have many applications in areas like bus routing, communication network, radio 
broadcast, and social network, ..., etc. [7] An independent dominating set, D, 
is efficient (a.k.a independent perfect dominating set) if every vertex of V is 
dominated by exactly one vertex of D. Efficient domination problem has several 
interesting applications in coding theory and the resource allocation of parallel 
processing system [3,8,10]. 

The decision version of the weighted domination problem is NP-complete 
even for cocomparability graphs [4]. For trapezoid graphs, Liang [9] shows that 
the minimum weighted domination and the total domination problem can be 
solved in 0{mn) time. Srinivasan et al. [13] shows that the minimum weighted 
connected domination problem in trapezoid graphs can be solved in 0{m + 
nlogn) time. Arvind et al. [1] showed that the weighted independent domina- 
tion problem in trapezoid graphs can be solved in 0{n^) time. Liang et al. [8] 
showed that the weighted efficient domination problem in trapezoid graphs can 
be solved in 0{nloglogn + m) time, where m denotes the number of edges in 
the complement of the trapezoid graph. 

The paper is organized as follows. Section 2 establishes basic notations and 
some interesting properties of trapezoid graphs. Section 3 gives our 0(n log n)- 
time algorithm of finding the minimum weighted independent dominating set 
in trapezoid graphs. Section 4 extends the algorithm of Section 3 and gives an 
0(n log n)-time algorithm of finding the minimum weighted efficient dominating 
set in trapezoid graphs. Finally, we conclude our results in Section 5. 



2 Basic Notations and Properties 



The trapezoid representation of a trapezoid graph G = (V, E) consists of two 
parallel lines L\ (the lower line) and L 2 (the upper line), such that for each 
element v G V there corresponds a trapezoid using /„ C Li (Jy C L 2 ) as 
the the lower (upper) interval. Counterclockwise rotating the upper line by 90° 
as the y-axis, and using the lower line as the i-axis, we obtain the box repre- 
sentation of a trapezoid graph. Note that each vertex of the trapezoid graph 
corresponds a rectangle box in the two dimensional representation. Figure 1 
shows a trapezoid graph with its corresponding trapezoid and box representa- 
tion. Clearly the trapezoid representation and the box representation are just 
two different interpretations of the same problem. Given a vertex v G V, we use 
tl{v) (tr{v)) and bl{v) {br{v)) to denote the top left (right) and the bottom left 
(right) corner points of its corresponding trapezoid representation. Define two 
points l{v) = {bl{v),tl{v)) and u{v) = (for(u), fr(u)), both in be the lower 
left and upper right corners of the corresponding box of u. It can be shown that 
any trapezoid representation can be transformed into another trapezoid repre- 
sentation with all corner points are distinct while the two representation still 
corresponds to the same trapezoid graph. Thus, we assume that each trapezoid 
of the given problem (with n vertices) has four distinct corners and all corner 
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Fig. 1. A trapezoid graph, its trapezoid representation, and the box represen- 
tation. 



points are distinct with consecutive integer values from 1 to 2n on both parallel 
lines. 

Given a point p G R^, we use Px (py) to denote the x (y) coordinate of p. 
Given two points p,q G R^, p is said to dominated by q, denoted hy p < q, 
if Px < qx and Py < qy. Similarly, given the box representation of a trapezoid 
graph, we define two vertices a < b if u(a) < 1(b); that is, box(a) lies totally 
on the left lower side of box(6) or, equivalently, trapezoid(a) lies totally on the 
left of trapezoid(&). The corresponding partial ordered set is a trapezoid order. It 
can be easily verified that every trapezoid graph G corresponds with a trapezoid 
order P such that two elements are adjacent in G iff they are incomparable in P. 
A subset of vertices U C F of a given poset (F, <) is called a chain if any two 
vertices of U are comparable. A chain U is maximal if no other vertex of V can 
be added into U to form a larger chain. It is easily verified that 

Proposition 1. Given a trapezoid graph, G = (V, E), a subset of vertices D C 
V is an independent dominating set iff D is a maximal chain in the corresponding 
trapezoid order. 

Proof. Any two vertices of D are comparable in the trapezoid order since they 
are independent; further, it is maximal since the other vertices are adjacent to 
some vertices of D. The converse can also be easily shown. □ 

Thus, the problem of finding a minimum weighted independent dominating set in 
a trapezoid graph is equivalent to the problem of finding the minimum weighted 
maximal chain in the trapezoid order. 

Throughout the paper, we use the words “box” and “vertex” interchangingly 
for convenience. Given a vertex v G V , and a subset of vertices S G1 V , define 
Doms(v) = {u G S : u < ?;}; that is, Doms(v) are the boxes (vertices) domi- 
nated by box(x). A box of S is maximal if no other box in S dominates it. We 
use Max(S) to denote the set of maximal boxes in S; further, we abbreviate 
Max(Doms(v)) as MDs(v); also, DOMv(v) and MDv(v) are abbreviated as 
DOM(v) and MD(v). 

The basic idea of our algorithm is the following: we associate with each 
vertex v an aggregated weight, denoted by w'(v). The aggregated weight of v is 
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determined by 

w'{v) = w{v) + min{zi;'(u) : u G MD{v)} 

The vertex u which gives the minimum w'(v) to v is called the (immediate) prede- 
cessor of u. To avoid the complicated boundary condition, we insert two dummy 
vertices (boxes), vq located at the (0,0) and located at 

the (2n -|- l,2n -|- 1) (these two points are the degenerated case of two small 
boxes), each with zero weight. w'{vo) is initialized as zero. After computing all 
the aggregated weights, we have 

Proposition 2. The value ofw'{vn+i) is the minimum weight of the minimum 
weighted independent dominating set. Further, the predecessors of Vn+i are ex- 
actly the minimum weighted independent dominating set of the given trapezoid 
graph. 

Proof. It is easily seen that the predecessors of Vn+i is a maximal chain of the 
trapezoid order. Further, the last box of the minimum weighted maximal chain 
is exactly the (immediate) predecessor of Vn+i, caught while computing w'{v). 
The rest of the proof follows easily by induction. □ 

3 Independent Domination of Trapezoid Graphs 

Immediately following the previous discussion, a simple-minded algorithm can 
be constructed as following: for each vertex v, find the sets Dom{v) and MD{v), 
and compute w'{v) from left to right. If we do it carefully, it will cost 0{n-\-m) 
time to finish. Unfortunately, m can be as large as f2(n^). The essential idea for 
a faster algorithm is that we can not afford to examine every vertex in DOM{v) 
(or even MD{v)) to determine the value of w'{v) for each vertex v. 

The general idea of our algorithm is basically a recursively divide-and- 
conquer scheme. According to the x coordinates of the lower corners of V, we can 
partition the n boxes into two equal-sized subsets L,R C V (|L| = |i?| = n/2) 
such that all lower corners of L lie on the left of any lower corner of R; that is, 
max{Z(r;)a; : v € L} < min{Z(?;) 2 ; : v G R}. First, we recursively solve the problem 
for boxes in L. A propagation step will collect the information computed from 
the previous step and propagate these information into vertices of R. Finally, 
another recursive call for boxes in R (which now aggregated with extra informa- 
tion from L) will reveal the final solution. Note that the propagation step takes 
place before the second recursive call. The goal here is to do the propagation 
step in 0(n), implying that the problem can be solved in O(nlogn) time. The 
Overall scheme described so far is very similiar to [2] for finding the independet 
dominating set of permuation graphs. However, the propagation step we used in 
the trapezoid graphs models involves two-dimensional boxes, which will compli- 
cate the analysis and the algorithm. In contrast to the permutation graph, the 
models in consideration are just single points in the two-dimensional plane. 

Let S' be a subset of boxes in V and let u be a box whose lower corner lies 
on the right of all lower corners of S. Note that some boxes of MD{v) belong 
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to S. Define mps{v) (the minimum weight of the predecessor portion of S) as 
the minimum weight of boxes in S' H MD(v); i.e., 



mps{v) 



min{u>'(a) : a € S n MD{v)} if S H MD{v) ^ 0 
0 otherwise. 



Further, denote the two boxes with the high-most and low-most upper corners 
in Sr\MD{v) by firsts{v) and lasts{v). Note that firsts{v) or lasts{v) might 
not exist; in which case, we use 0 as the function value. 

Note that in the recursive version of our algorithm, the given input will be 
a set of boxes S, sorted by the consecutive x coordinates of their lower cor- 
ners. Let A be boxes of V that lie on the left hand side of S. While computing 
mps(v), the w'{-) values of boxes in A shall have already been determined by 
the algorithm. The recursive algorithm will divide S into two equal-sized sub- 
sets L and R. After return from the first recursive processing for L, the w'{-) 
values of each box of L (together with the minimum predecessor links) are now 
determined. 

The goal of the propagation step is, for each v £ R, to determine the set values 
of first aul{v), lastAuLiv), and mpAuL{v) in O(I'S'I) time during the propagation 
step. Before we go into the details, observe that our dynamic programming 
formula for determining the w'(v), v G L, can now be viewed as the following: 

w'(v) = w(v) -h min ({mpA(v)} U {w'(u) : u € MDl(v)}) 



For the clearness of the presentation, we will mainly discuss how the value of 
the minimum weight problem is calculated; however, it shall be clear that we 
can maintain a predecessor link for each vertex v £ V when the final w'{v) is 
determined. So the corresponding problem for finding the minimum independent 
dominating set can easily be modified in the algorithm. Further, if we choose the 
terminated condition for the recursive algorithm as when [S'! = 1, MDl(v) is 
just an empty set. In which case, w'{v) is simply given by w{v) mpA{v)- Thus, 
if the value of rnpAuL{v) for each box v £ R is determined in the propagation 
step, the same recursive scheme can now be applied to the boxes of R (now 
substitute S' by i? and substitute A by AU L.) Further, for each box v in R, the 
value of mpAuL{v) is determined by the following formula: 

mpAuUv) = min{mpA{v),mpL{v)} 

The situation is illustrated in Figure 2. Also, note that firstAuL{v) can be 
determined by the values of firstA{v) and firstL{v). Likewise, lastAuhiv) can 
be determined by the values of last^iv) and lastA{v). It shall be clear now that 
the essential part of the propagation step is: 

Lemma 1. For each v £ R, values ofmpL^v), as well as first^^v) and last^iv) 
can he determined totally within 0(|S|) time. 



Proof. From bottom to up, we scan each box of R according to the y coordinates 
of the lower corners. During the scanning process, we maintain a stack, AT, 
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Fig. 2. Calculation of mpAuLiv). 



containing a subset of R scanned so far. Boxes of K form a (so far maximal) 
chain in the trapezoid (box) order. That is, if if = (6 ^, . . . , 6i) (from bottom to 
top), then we have bi < bi-i < ■ ■ ■ < bi. Identify boxes of L whose y coordinates 
of the upper corners located between l{b\)y and l{v)y but not include those 
boxes that are not dominated by v. Denote these boxes by By. It is not hard to 
verify that we can find Mq = MaxB^{v) in 0{\By\) time [12]. Let high{Mo) and 
Iow(Mq) be the two boxes with the high-most and the low- most upper corners 
of Mq. If the top of the stack, 6i, is dominated by the currently scanned box v, 
it is clear that Mq is exactly the set MDl{v). Thus we can scan boxes of Mq to 
compute mpL{v) = min{w'(a) : a € Mq}. Note that firsthiv) and lastL(v) are 
just high{Mo) and Iow{Mq). The box v is pushed into the stack when mpL{v), 
firstL{v), and lastL{v) are determined. 

For the case that bi is not dominated by v, we can pop out bi and check 
for the condition of 62. Finally, we shall find a box of K, bj, that is dominated 
by V. (Recall that we can add a dummy box in the bottom of stack to avoid the 
complicated boundary condition.) Note that bj < v, but boxes 61 , ... , are 
not dominated by v. Similar to the definition of Mq, there shall be a sequence of 
subsets of L, namely Mi = MD^ibi), . . . ,Mj_i = MDi(6j_i); they are deter- 
mined while boxes 61 , ... , bj-i being pushed into the stack. The set MDl{v) can 
now be determined by scanning through all vertices of MqU- ■ - UMj-i; it follows 
that mpL^v) = min{zi;'(a) : a € MDl{v)}, and the corresponding firstL{v) and 
lastL{v) can also be determined. Again, we will then push v into the stack. It 
shall be clear that, after the topmost box of R is scanned, we have correctly 
determined the mpL{-), firstL{-), and /asti(-) values for each box of R. 

To justify the whole process takes 0(|S'|) time, first observe that the com- 
putation of Mq = MaxB^{v) for each box v £ R totally spends 0(|i?|) + 
^^g^O(|i?„|) = 0(|i?|) -I- 0(|L|) time. Secondly, every box, in R, that is pop 
out of the stack will not be reexamined again, so the total cost is 0(|i?|). Most 
importantly, since boxes of Mi’s are sorted, from high to low, according to the y 
coordinates of their upper corners, the process of merging all Mi’s into MDl{v) 
takes 0{j) time plus the time needed for eliminating the unwanted boxes (they 
are dominated by some boxes of Mk who lies above them.) The important ob- 
servation is that these boxes can only be eliminated once because they are domi- 
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nated by boxes of some Mk- Since 'Y^O{j) = 0(|i?|), the time needed to maintain 
the stack, the cost of elimination totally takes only 0(|L|)+0(|i?|) time. Thus we 
conclude that the propagation step of the algorithm spends 0{\L\ + \R\) = OdS”!) 
time. □ 

Now we are ready to present our algorithm for finding the minimum independent 
dominating set of trapezoid graphs as shown in Figure 3. 



Algorithm MIDS(5) 

Input: A sequence of contiguous boxes S = (ui, . . . , of V. Note that 

the values of mpA{-) have already been determined where A = (vo , . . . , Vi-i). 
Output: The values of w'{-) for each box in S. 

Step 1: If ^ = 1, w'(vi) = w{vi) + mpA{vi)\ return to the caller. 

Step 2: Split S into two equal-sized subsequence L and R. 

Step 3: Recursively call MIDS(L), which computes values of w'{-) for boxes 
in L. 

Step 4- The propagation step, described in Lemma 1 will modify the values of 
mpAuLi') for each vertex in R accordingly. 

Step 5: Recursively call MIDS(R), which computes values of w'{-) for boxes 
in R. 

End of MIDS 



Fig. 3. finding the minimum independent dominating set in trapezoid graphs. 



Theorem 1. Let vq and Vn+i he two zero-weighted dummy boxes, located at 
the (0,0) and (2n -|- l,2n-|- 1) (these two boxes are two degenerated boxes as 
two points.) The algorithm MIDS({vq U FU {vn+i\) return w'(-) for each vertex 
in V in 0(n log n) time and 0(n) space. Further, w'{vn+i) is the solution for 
the minimum weighted independent dominating set problem. 

Proof. The correctness of the algorithm follows from Observation 2 and 
Lemma 1. Let T{i) denote the running time of the algorithm MIDS(S'). Clearly 
Step 2 requires at most 0{£) time. Further, by Lemma 1, Step 4 requires 0{£) 
time. Thus we have the recurrence T{£) = 2T(£jT) A- 0{£). It follows that 
T{£) = 0{£\og£). Further, space requirement for the algorithm is for the values 
of w' {■), mp{-) , first {■) , and last{-)’s for each box of V. Thus we conclude that 
MIDSduoUF U{u„+i}) correctly determines the minimum weighted independent 
dominating set problem in 0(n log n) time and 0(n) space. □ 



4 Efficient Domination of Trapezoid Graphs 

As mentioned in Section 1, efficient domination problem has several interesting 
applications in coding theory and the resource allocation of parallel processing 
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system. Here we present our algorithm, which is very similiar to the algorithm 
mentioned at the previous section, for finding the weighted efficient dominating 
set of trapezoid graphs in 0(n log n) time. 

By Observation 1, we know the efficient dominating set of a trapezoid graph 
forms a maximal chain in the trapezoid order since an efficient dominating set is 
an independent dominating set. Thus, it is possible that we can refine the MIDS 
algorithm to solve the efficient domination problem in trapezoid graphs. 

Given a box v, consider all boxes. A, whose top left corners lie on the top left 
position of the lower corner of v. Let t(v) be the box whose top left corner is 
the leftmost point among A. Symmetrically, consider all boxes, B, whose bottom 
right corners lie on the bottom right position of the upper corner of v. Let b{v) 
be the box whose top left corner is the rightmost point among B. It is shown 
that t{v) and b{v) can be computed in 0{n) time [8]. Further, we denote the top 
left corner (a point) of t{v) by t'{v), and the bottom right corner of b{v) by b'{v). 
Note that, if u is an immediate predecessor oiv(ud MD{v)), u might not be an 
immediate efficient predecessor of v. That is, there might be a box incomparable 
to both u and v. However, it is easily verify that u is also an efficient predecessor 
of r; if dominates b{u) and, at the same time, t{v) dominates u. Note that both 
of the conditions can be checked in constant time after the t{-),b{-) values are 
decided. 

Note that the only place we need to change in the algorithm MIDS is to mod- 
ify the meaning of MD{v) (immediate predecessors) into MD'{v) (immediate 
efficient predecessors.) That is, in the proof of Lemma 3, when the algorithm try 
to find the set Mg = Maxs^iy), we also check the condition that whether boxes 
of MaxB„{v) are efficient predecessors of v. Only the efficient predecessors in 
MaxB„ are put into Mg. 

The correctness and time complexity of the modification version of the algo- 
rithm, which we call it the MEDS algorithm for the obvious reason, is similar to 
what we have done for the MIDS algorithm, and is skipped here. 

Theorem 2. The modified algorithm MEDS({vq U E U {w„+i } ) return w'(-) for 
each vertex in V in 0(n log n) time and 0{n) space. Further, w'{vn+i) is the 
solution for the minimum weighted efficient dominating set problem. 

5 Concluding Remarks 

In this paper, we show that the minimum weighted independent dominating set 
problem and the minimum weighted efficient dominating set problem in trape- 
zoid graphs can both be efficiently solved in O(nlogn) time. It is interesting 
to note that, since the essential scheme of our algorithm a recursive one, the 
algorithm does not benefit from the fact that the input trapezoid representa- 
tion are given in the consecutive integer values. Is it possible that the minimum 
weighted independent dominating set problem of trapezoid graphs (and per- 
mutation graphs as well) can be solved in time complexity like 0(n log log n)? 
Finally, the author will like to thank Chin-Lung Lu for many valuable discussions 
of the efficient domination problem. 
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Abstract. Drawing a graph symmetrically enables an understanding of 
the entire graph to be built up from that of smaller subgraphs. This 
paper discusses symmetric drawings of planar graphs. More specifically, 
we discuss planar geometric automorphisms, that is, automorphisms of a 
graph G that can be represented as symmetries of a planar drawing of G. 
Finding planar geometric automorphisms is the first and most difficult 
step in constructing planar symmetric drawings of graphs. The problem 
of determining whether a given graph has a nontrivial geometric auto- 
morphism is NP-complete for general graphs. In this paper, we present a 
polynomial time algorithm for finding planar geometric automorphisms 
of graphs. 



1 Introduction 

Graph drawing is constructing a visually-informative drawing of an abstract 
graph in the plane. Symmetry is one of the most important aesthetic criteria 
that represent the structure and properties of a graph visually. Also, a symmetric 
drawing of a graph enables an understanding of the entire graph to be built up 
from that of a smaller subgraph [13,14,15]. 

Symmetric drawings of a graph G are clearly related to the automorphisms 
of G (the relationship is discussed below), and algorithms for constructing sym- 
metric drawings have two steps: 

1. Find the “appropriate” automorphisms, and 

2. draw the graph displaying these automorphisms as symmetries. 

* This research has been supported by KOSEF No. 971-0907-045-1, the Australian 
Research Council, and the SCARE project at the University of Limerick. This paper 
was partially written when the first author was visiting the University of Newcastle, 
and partially written while the first two authors were visiting the University of 
Limerick. 
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Section 2.2 of this paper defines the “geometric automorphisms” of a graph, 
which are the “appropriate” automorphisms mentioned in the first step above. 
Intuitively, an automorphism of a graph G is “geometric” if it can be represented 
as a symmetry of a drawing of G. The concept of geometric automorphism can 
be refined to the concept of “planar geometric automorphism” . We outline an 
algorithm for finding planar geometric automorphisms. The second step, con- 
structing a drawing which displays the planar geometric automorphisms, is not 
discussed in this paper. 

We should note that the problem of finding a planar geometric automorphism 
is related to, but not equivalent to, the problem of finding automorphisms. The 
problem of determining whether a graph has a nontrivial automorphism is equiv- 
alent to the graph isomorphism problem, that is, it is isomorphism complete [16]. 
However, the problem of determining whether a graph has a nontrivial geometric 
automorphism is NP-complete [12,15]; it is probably strictly harder than graph 
isomorphism. Previous work on symmetric graph drawing [13,14,15,6] has fo- 
cused on restricted classes of graphs such as trees [13], outerplanar graphs [14], 
and series-parallel graphs [6]. This paper presents a polynomial time algorithm 
for finding planar geometric automorphisms of connected planar graphs. 

In the next section, we review the background: methods for finding auto- 
morphisms of planar graphs, symmetries of drawings of graphs, and the concept 
of planar geometric automorphisms. Section 3 outlines our algorithm. Section 4 
concludes. 

2 Background 

2.1 Automorphisms 

An isomorphism between two graphs Gi = (Vi,Ei) and G 2 = (V 2 ,T' 2 ) is a one- 
one function a from Vi onto V 2 which preserves adjacency; that is, (u,f) € Ei 
if and only if (a(rt), a(w)) S i? 2 - If Gi = G 2 then a is an automorphism. The set 
of automorphisms of a graph forms a group, and a subgroup of this group is an 
automorphism group. The isomorphism problem is to determine all isomorphisms 
between two given graphs Gi and C? 2 - If Gi = G 2 then the isomorphism problem 
is the automorphism problem. 

Automorphisms of a graph are permutations of the vertices, and some of 
the terminology of permutation groups is helpful [19]. We denote the iden- 
tity permutation by I. The group generated by oi, 02 , ■ ■ • , ctfe is denoted by 
< oi, 02 , ■ • • , CTfc >■ If a permutation a acting on a set V has a fixed element 
V € V, that is, o(ri) = v, then a induces a permutation ay on V — {z;}. 

The algorithm of Hopcroft and Tarjan [7,8] may be used to compute automor- 
phisms of planar graphs. It proceeds by reducing the problem to the triconnected 
case; we adopt a similar approach. The algorithm uses 0(n log n) time. Later, 
Hopcroft and Wong [10] reduced the time complexity to linear, but with a large 
constant. Fontet [4] presents another linear time algorithm. Most of this research 
focuses on the case of planar embeddings. 
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These methods all produce “automorphism partitions” as part of the iso- 
morphism test; essentially the vertex set is partitioned into the orbits of the 
automorphism group. This can be used to assign an integer code{G) for a planar 
graph G such that code{G) = code{G') if and only if G is isomorphic to G' . 
The integer code(G) is called the isomorphism code of G. The automorphism 
partition is useful for drawing graphs symmetrically; intuitively, if Gi and G 2 
are subgraphs of G such that Gi is isomorphic to G 2 , then we can draw Gi 
congruently to G 2 . 

Theorem 1. [10,4] Suppose that G\, G 2 , . . . , Gfc are planar graphs. Then in lin- 
ear time we can compute integers code{Gi),code{G 2 ), ■ ■ ■ ,code{Gk) such that 
code{Gi) = code(Gj) if and only if Gi is isomorphic to Gj. 

The automorphism partition works even if the graphs are labeled (that is, a 
vertex of label x can be mapped only to a vertex of label x) . 

Connectivity plays a large role in these algorithms. The uniqueness of sphere 
embeddings of triconnected graphs restricts the automorphism group consid- 
erably, and makes the problem easier. In general, a planar graph can have an 
exponential number of planar embeddings. The most difficult parts of algorithms 
to find automorphisms are the transformations from the triconnected case to the 
biconnected case, and from the biconnected case to the connected case. 

2.2 Symmetries and Geometric Automorphisms 

It is important to use a rigorous model for the intuitive concept of symmetry 
display. The model below is derived from those introduced by Manning [13,14,15] 
and Lin [11,3]. 

The symmetries of a set of points in the plane (such as a two dimensional 
graph drawing) form a group called the symmetry group of the set. A symmetry 
(T of a drawing Z? of a graph G induces an automorphism of G; the restriction of 
(7 to the points representing vertices of G is an automorphism of G. A drawing D 
of a graph G displays an automorphism a of G if there is symmetry a of D which 
induces a. The symmetry group of a graph drawing induces an automorphism 
group of the graph. An automorphism group P of a graph G is a geometric 
automorphism group if there is a drawing of G which displays every element 
of P. 

To understand the difference between automorphisms and geometric auto- 
morphisms, consider Figure 1 (from [11,3]). Figure 1(a) shows the graph /F 2 , 3 - 
The permutation (123)(45) is an automorphism but not a geometric automor- 
phism; in fact the automorphism group of has size 12, but the only geometric 
automorphism is the one displayed in Figure 1(a), that is, (13)(2)(45). 

A group-theoretic characterization of geometric automorphism groups [11,3] 
follows. A permutation group P is semiregular if each non-identity permuta- 
tion in P does not have a fixed element. A permutation p on 1/ is a rotational 
permutation if either < p > or < p„ > (for some r; € M) is semiregular, and 
j < p > j > 1. Note that a rotational permutation has at most one fixed element. 
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A permutation p on 1/ is an axial permutation if = / and p is a non-identity 
permutation. 

Lemma 1. [11] An automorphism of a graph is geometric if and only if it is 
either an axial permutation or a rotational permutation on the vertex set. 

The ways in which geometric automorphism groups can be combined is not 
as simple as for automorphism groups. It is trivial to see that the union of two 
automorphism groups generates an automorphism group; thus there is a single 
maximal automorphism group. However, a graph can have geometric automor- 
phism groups that are distinct and not combinable; the union of two geometric 
automorphism groups does not necessarily generate a geometric automorphism 
group. This is shown by Figure 1(b) and (c). The graph has a geometric au- 
tomorphism group of size 8 (Figure 1(b)), and a geometric automorphism group 
of size 6 (Figure 1(c)). However, these groups do not combine to make a geo- 
metric automorphism group; each is maximal in that it cannot be made larger. 
In fact. Figure 1(b) displays 8 automorphisms, and the maximum size geometric 
automorphism group of K 4 has size 8. 





(a) 



(b) 




(c) 



Fig. 1. Three drawings with different symmetries. 



Our aim is to find drawings with a maximum number of symmetries, and 
this means finding a maximum size geometric automorphism group. Thus the 
central problem in drawing graphs symmetrically is as follows. 

Geometric Automorphism Problem(GAP) 

Input: A graph G. 

Output: A maximum size geometric automorphism group of G. 

It has been shown [12,15] that GAP is NP-hard. 

2.3 Planar Geometric Automorphisms 

The concepts above can be extended to planar drawings: a geometric automor- 
phism a of a graph G is planar if there is a planar drawing of G which displays 
a, and an automorphism group P is a planar geometric automorphism group if 
there is a planar drawing which displays P. 
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Not every geometric automorphism is planar. For example, the rotational au- 
tomorphism displayed in Figure 1(b) is not planar; the largest planar geometric 
automorphism group of 7^4, displayed in Figure 1, has size 6. 

The central problem of this paper is to find a planar geometric subgroup of 
maximum size. 

Planar Geometric Automorphism Prohlem(PGAP) 

Input: A planar graph G. 

Output: A planar geometric subgroup P of the automorphism group of G, 
such that P has maximum size. 

Previous research on PGAP has concentrated on subclasses of the class of 
planar graphs. An algorithm of Manning [15] finds planar geometric automor- 
phisms of a fixed planar embedding (with a fixed outer face) in linear time. For 
a triconnected graph there are only 0(n) planar embeddings; by selecting each 
outer face in turn, one can solve PGAP for triconnected graphs in quadratic 
time. It has been solved for trees [13,15] and series-parallel graphs [6]. In this 
paper we investigate PGAP for planar graphs. The difficulties are mainly in the 
transformations between connectivity classes. 

3 Finding Planar Geometric Automorphisms 

In this section we outline tools for finding planar geometric automorphisms in 
planar graphs. Many details are omitted for this extended abstract; a more 
complete description appears in [5]. 

We use connectivity to divide the problem into cases. 

1. G is triconnected. 

2. G is biconnected. 

3. G is one-connected. 

4. G is not connected. 

For each case, our algorithm takes for input a graph with vertices and edges 
possibly labeled, and returns a planar geometric automorphism group for the 
graph, as well as (possibly) some labels for the vertices and edges. Each case 
relies on the result of the previous case. 

The first case can be dealt with by the algorithm of Manning [15]. The second 
case is the most difficult, and it is described in the Section 3.2. The third case 
is described in Section 3.1. The fourth case is omited in this extended abstract. 

3.1 The One-Connected Case 

The block- cutvertex tree of a connected graph G [2] has a B-node for each block 
(biconnected component) of G, and a G-node for each cutvertex of G. There 
is an edge between a G-node u and a S-node b if and only if u belongs to the 
corresponding block of b in G. The block-cutvertex tree can be constructed in 
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linear time [17]. A center of a tree is a vertex whose maximum distance from a 
leaf is minimized. Note that in a block-cutvertex tree, there is only one center 
(since every leaf is a i?-node). The center of the block-cutvertex tree may be a 
block or a cutvertex. 

To find geometric automorphisms of a one-connected graph G we need to use 
the method for biconnected (described in Section 3.2) graphs as a subroutine. 

The method proceeds from the leaves of the block-cutvertex tree to the center; 
we may regard the block-cutvertex tree as rooted at the center. At each stage, 
all the blocks corresponding to leaves of the block-cutvertex tree are processed. 
Each cutvertex u of the input graph has three labels. Note that a cutvertex 
occurs in more than one block and in general it has different labels in different 
blocks. The three labels are: 

— A boolean label rtp, which has the value true when u is the cutvertex that 
joins the block B to its parent in T. 

— An isomorphism code Uq. This is a set of integers; the number of elements 
in Uq is the number of children of rt in T. If m and v are two cut vertices 
with Uq = Vq then the subgraph of G graph represented by the subtree un- 
der u in T is isomorphic to the subgraph represented by the subtree under v. 

— A label u®, which indicates whether the subgraph represented by the subtree 
under u has an axial planar automorphism which fixes the cutvertex that 
joins B to its parent. If such an automorphism exists, then this label also 
indicates the maximum number of edges incident with u which are fixed by 
the automorphism. 

The algorithm begins with the labeling phase, as follows. 

1. Construct the block-cutvertex tree T, and for each cutvertex u and each 
block B, compute Up. 

2. For each cutvertex u and each block B, initialize Uq = = null. 

3. Repeat until T has one node (the center): 

(a) Compute the labeled automorphism partition of the set S of blocks of G 
corresponding to leaves of T. Use the automorphism partition to compute 
Uq for each cutvertex u in each block B G S. 

(b) Use the algorithm for biconnected planar graphs (in the next section) to 
test whether each B G S has an axial planar geometric automorphism 
which fixes the cutvertex in B. This gives u^. 

(c) Remove all the leaves of T from T. 

After the labeling phase, we process the center of the tree T. If the center is 
a block, then we call the method for (labeled) biconnected graphs in the next 
section. If it is a cutvertex, then a simpler algorithm applies; details are omitted 
for this abstract. 

3.2 The Biconnected Case 

If the input graph is biconnected, then we break it into triconnected components 
in a way that is suitable for the task. The method is similar to but more complex 
than the method of the previous section. 
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First we review the definition of 3-blocks or triconnected components of a 
biconnected graph. If G is triconnected, then G itself is the unique triconnected 
component of G. Otherwise, let tt, u be a separation pair of G. We split the 
edges of G into two disjoint subsets Ei and E 2 , such that |ifi| > 1, |if 2 | > 1, 
and the subgraphs Gi and G 2 induced by E\ and E 2 only have vertices u and v in 
common. Form the graph G{ by adding an edge (called a virtual edge) between u 
and v; similarly form G 2 . We continue the splitting process recursively on G[ 
and G '2 ■ The process stops when each resulting graph reaches one of three forms: 
a triconnected simple graph, a set of three multiple edges (triple bond), or a cycle 
of length three (triangle). The triconnected components of G are obtained from 
these resulting graphs by merging the triple bonds into maximal sets of multiple 
edges (bonds), and the triangles into maximal simple cycles (polygons). The 
triconnected components of G are unique. See [9] for further details. 

We can define the 3-block tree as follows^. The vertices of the 3-block tree are 
the triconnected components of G. The edges of the 3-block tree are the virtual 
edges, that is, if two triconnected components have a virtual edge in common, 
then the vertices that represent the two triconnected components in the 3-block 
tree are joined by an edge that represents the virtual edge. Note that the tree 
has only one center; the rooted tree rooted at the center is unique [1]. 

The 3-block tree may be constructed in linear time by adjusting the algorithm 
of Hopcroft and Tarjan [9]. An example of a biconnected graph and its 3-block 
tree (from [9]) is in Figure 2. 




Fig. 2. Example of 3-Bloek Tree. 



^ The SPQR-tree defined by di Battista and Tamassia [1] is closely related. There are 
many variants of the 3-block tree in the literature; the first was defined by Tutte [18]. 
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We use a kind of “reduction” [8] , similar to the labeling process for the block- 
cutvertex tree defined in the previous section. The operation takes a 3-block tree 
of a biconnected graph G and deletes the triconnected components represented 
by the leaves. It then adds and labels new edges between the separation pairs^. 
We repeat this process until G becomes a single triconnected component (the 
center of the 3-block tree). The process is called tree- shrinking by Wong [20]. 

As the triconnected component A is deleted from G, an isomorphism code 
is attached to the new edge e joining its separation pair in the remaining part 
of the graph G. In fact, the method assigns a pair of isomorphism codes to 
each triconnected component. This is because the triconnected component has 
an orientation with respect to its separation pair. Suppose that A and B are 
two triconnected components which share the separation pair u, v; suppose that, 
in A, (u,u) has the code {xa^Va) and, in B, (u,v) has the code (xb^Vb)- The 
relationship between A and B is encoded in these pairs as follows. 

— If = xb and yA = Vb then there is an automorphism oi A\J B which 
fixes u and v and swaps the other vertices of A with the other vertices of B. 
(See Figure 3(a).) 

— If xa = Vb and yA = xb then there is an automorphism oi A B which 
swaps u and v and swaps the other vertices of A with the other vertices of B. 
(See Figure 3(b).) 

— If xa = yA then A has an axial automorphism which swaps u and v. (See 
Figure 3(c).) 

This pair can be computed using the algorithm of Hopcroft and Tarjan for 
automorphism partition of triconnected components [8] . 




(a) 




Fig. 3. Examples for isomorphism eodes. 



In addition to the pair of isomorphism codes, we attach further informa- 
tion about the axial automorphisms of a triconnected component. We add this 
information to the new edge. 

To find geometric automorphisms in a planar graph, we need a method to test 
axial automorphisms about a separation pair^. This is illustrated in Figure 4: 

^ Note that multiple edges may occur. 

® This is related to Manning’s A-class test. 
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if B has an axial automorphism, then there are two axial automorphisms of the 
whole graph. However, if it does not, then the number of axial automorphisms 
is one. Details of the computation are omitted. 



o 

: V 



Fig. 4. Example for testing axial automorphisms. 



3.3 Time Complexity 

If G is triconnected then as mentioned previously. Manning’s linear time algo- 
rithm for finding geometric automorphism in the embedded planar graph [15] 
can be used for each of the 0{n) planar embeddings. Thus it takes 0{n^) time. 

When G is composed of one biconnected component, the most expensive 
operation is finding axial automorphisms of each of the triconnected components. 
Each test takes time which is quadratic in the size of the triconnected component. 
Thus in total we use O(n^) time. A similar argument holds for the one-connected 
case. Thus our algorithm has 0{n^) time complexity, where n is the number of 
vertices. 



4 Conclusion 

In this paper we present an algorithm for finding geometric automorphisms in 
a planar graph. This algorithm shows that the PGAP problem can be solved in 
polynomial time, in contrast to the more general GAP problem. 

The relatively high complexity O(n^) is mainly caused by the need to de- 
termine geometric automorphisms of triconnected graphs; we just naively adopt 
Manning’s algorithm for each planar embedding, resulting in 0(n^). The com- 
plexity of the whole algorithm could be reduced if the dependence on the choice 
of outer face were not so critical. 

Further, we would like to turn our attention to the drawing phase. For tri- 
connected graphs, one can make symmetric drawings (given a geometric auto- 
morphism group) subject to a variety of constraints (for example, straight line 
edges). However, extending these methods seems to be challenging. 



286 Seok-Hee Hong et al. 



References 

1. G. Di Battista and R. Tamassia, On-Line Maintenance of Triconnected Compo- 
nents with SPQR- Trees, Algorithmica 15, pp. 302-318, 1996. 283 

2. J.A. Bondy and U.S.R. Murty, Graph Theory with Applications, North Holland, 
Amsterdam, 1976. 281 

3. P. Eades and X. Lin, Spring Algorithms and Symmetry, Computing and Combi- 
natorics, Springer Lecture Notes in Computer Science 1276, (Ed. Jiang and Lee), 
pp. 202-211. 279 

4. M. Fontet, Linear Algorithms for Testing Isomorphism of Planar Graphs, Proceed- 
ings Third Colloquium on Automata, Languages, and Programming, pp. 411-423, 
1976. 278, 279 

5. S. Hong, Geometric Symmetry of Graphs and their Drawing Algorithms, KOSEF 
(Korea Science and Engineering Foundation) Report, 1998 (in Korean). 281 

6. S. Hong, P. Eades, A. Quigley and S. Lee, Drawing Algorithms for Series-Parallel 
Digraphs in Two and Three Dimensions, GD98 (Proceedings of the Sixth Sym- 
posium on Graph Drawing), Lecture Notes in Computer Science, Springer (to 
appear). 278, 281 

7. J. E. Hopcroft and R. E. Tarjan, A VlogV Algorithm for Isomorphism of Tricon- 
nected Planar Graphs, J. Comp. System Sci. 7, pp. 323-331, 1971. 278 

8. J. E. Hopcroft and R. E. Tarjan, Isomorphism of Planar Graphs, Complexity of 
Computer Computations, R. E. Miller and J. W. Thatcher, eds., Plenum Press, 
New York, pp. 131-151, 1972. 278, 284 

9. J. E. Hopcroft and R. E. Tarjan, Dividing a Graph into Triconnected Components, 
SIAM J. on Comput. 2, pp. 135-158, 1973. 283 

10. J. E. Hopcroft and J. K. Wong, Linear Time Algorithm for Isomorphism of Planar 
Graphs, Proceedings of the Sixth Annual ACM Symposium on Theory of Com- 
puting, pp. 172-184, 1974. 278, 279 

11. X. Lin, Analysis of Algorithms for Drawing Graphs, Ph.D. Thesis, University of 
Queensland, 1992. 279, 280 

12. A. Lubiw, Some NP-Complete Problems Similar to Graph Isomorphism, SIAM 
Journal on Computing 10(1):11-21, 1981. 278, 280 

13. J. Manning and M. J. Atallah, Fast Detection and Display of Symmetry in Trees, 
Congressus Numerantium 64, pp. 159-169, 1988. 277, 278, 279, 281 

14. J. Manning and M. J. Atallah, Fast Detection and Display of Symmetry in Out- 

erplanar Graphs, Discrete Applied Mathematics 39, pp. 13-35, 1992. 277, 278, 

279 

15. J. Manning, Geometric Symmetry in Graphs, Ph.D. Thesis, Purdue Univ., 1990. 
277, 278, 279, 280, 281, 285 

16. R. Mathon, A Note on Graph Isomorphism Counting Problem, Information Pro- 
cessing Letters 8, 1979, pp. 131-132. 278 

17. R. E. Tarjan, Depth-first Search and Linear Graph Algorithms, SIAM J. Comput. 
1, pp. 146-160, 1972. 282 

18. W. T. Tutte, Graph Theory, Encyclopedia of Mathematics and Its Applications, 
Vol. 21, Addison-Wesley, Reading, MA, 1984. 283 

19. H. Wielandt, Finite Permutation Groups, Academic Press, 1964. 278 

20. J. K. Wong, Isomorphism Problems Involving Planar Graphs, Ph.D. Thesis, Cornell 
Univ., 1975. 284 



New Approach for Speeding Up 
Enumeration Algorithms 



Takeaki Uno 

Dept. Industrial Engineering and Management, Tokyo Institute of Technology 
2-12-1 Oh-okayama, Meguro-ku, Tokyo 152, Japan 
uno@me . titech. ac.jp 



Abstract. We propose a new approach for speeding up enumeration 
algorithms. The approach does not relies on data structures deeply, but 
utilizes some analysis of its computation time. The enumeration algo- 
rithms for directed spanning trees, matroid bases, and some bipartite 
matching problems are speeded up by this approach. For a given graph 
G = (V,E), the time complexity of the algorithm for directed spanning 
tree is 0(log^ |E|) per a directed spanning tree. For a given matroid M, 
the algorithm for matroid bases runs in 0{T/n) time per a base. Here n 
denotes the rank of M , and T denotes the computation time to obtain el- 
ementary circuits. Enumeration algorithms for matching problems spend 
0(|y|) time per a matching. 



1 Introduction 

For many graph and geometry objects, enumeration algorithms have been de- 
veloped. Speeding up is one of important and interesting parts of the studies on 
enumeration algorithms. There are many algorithms and their improvements, 
especially for spanning trees and paths [1,2, 3, 4], although neither generalized 
technique nor framework is proposed for these improvements. Almost all fast 
enumeration algorithms are improved by speeding up their iterations with some 
data structures. Hence if we can not speed up iterations, we may obtain no fast 
algorithm. 

In this paper, we propose a new approach “trimming and balancing” for 
speeding up enumeration algorithms. Our approach is not based on data struc- 
tures, hence we can apply it to many enumeration algorithms which have not 
been improved in the existing studies. Our approach adds two new phases to 
an enumeration algorithm, which are called the trimming phase and the bal- 
ancing phase. By adding these, an iteration of a modified algorithm may take 
much computation time than original one, but the total time complexity of the 
modified algorithm is often considerably smaller than the original. Some our 
algorithms with the worst case time complexity of an iteration larger than the 
original one often attain a better upper bound of the time complexity than the 
original. The time complexity of a trimming and balancing algorithm is not so 
easy to analyze. We use a technique to analyze time complexities. 
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In the next section, we show the framework of the approach and the tech- 
nique of our analysis. We also show a trimming and balancing algorithm for the 
enumeration problem of directed spanning trees. 

2 Framework of Trimming and Balancing 

At the beginning of this section, we explain the idea of the trimming phase 
and balancing phase. To explain it, we use a binary partition algorithm for 
the enumeration problem of directed spanning trees whose root is a specified 
vertex r. A directed spanning tree ( denoted by DST ) is a spanning tree of a 
digraph satisfying that no its arc shares its head with the other. The algorithm 
inputs a digraph, and chooses an not “unnecessary” arc a* (which is called a 
partitioning arc ). An unnecessary arc is an arc included in all DSTs or no 
DST. The algorithm divides the problem into two subproblems of enumerating 
all DSTs including a*, and all those not including a*. These subproblems can 
be solved recursively with the graph obtained by removing all the arcs sharing 
their heads with a*, and that obtained by removing a* . If there is only one DST, 
then all the arcs are unnecessary. Hence the algorithm stops. The algorithm is 
known to take 0(|A|) time per an iteration, and per an outputted DST. Here 
we show the details of the algorithm. 

ALGORITHM: Enum_DST (G = {V,A) ) 

Step 1 : Find a partitioning arc a* . If there are only unnecessary arcs, 
then output the unique DST and stop. 

Step 2: Remove all arcs sharing their heads with a*, and call Enum_DST(G). 
Step 3 : Remove a* from G, and call Enum_DST (G) recursively. 

The trimming phase removes unnecessary parts from the input. In this al- 
gorithm, unnecessary arcs can be removed or contracted, since removals and 
contraction of these arcs effect no change to the original problem. These re- 
movals and contractions reduce the size of the input, thus the reduced input 
includes many outputs for its size. For example, as we show in the later section, 
a digraph G = (F, A) including no unnecessary arc ( “unnecessary” is charac- 
terized in the just above ) contains I?(|A|) DSTs. By the trimming phase, the 
computation time of an iteration will be not so large for the number of outputs, 
hence the total computation time per an output will be small. 

The trimming phase decreases the computation time but does not always 
decrease the computation time of the worst case. Suppose that the algorithm 
inputs a digraph shown in the Figure 1. As we can see, no arc is unnecessary. 
Now we suppose that the algorithm choose the arc a as a partitioning arc. Since 
only one DST includes a, one of the subproblem terminates immediately. To 
construct the other subproblem, a is removed. By the trimming algorithm, the 
arc sharing its head with a is contracted. The shape of the obtained graph is 
same as the original. If all the subproblems occurring in the algorithm choose 
the end-arc as a partitioning arc like the above, the total computation time will 
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Fig. 1. A digraph for enumeration problem of DSTs 



be 0(|A| + (|A| — 2) + (|A| — 4) + ... + 4). One of the subproblems of them outputs 
a DST, hence the computation time is 0(|A|) per a DST. 

Why the worst case running time does not decrease? The answer is that the 
choosing rule of partitioning arcs is bad. To reduce the time complexity, we have 
to make a good rule. The balancing phase is added for this reason. It chooses 
a partitioning arc such that both generated subproblems output not so few 
DSTs. The subproblems have not so many arcs for the number of outputs after 
the trimming phase, since we have a lower bound of the number of DSTs for its 
input size. Thus, if the input includes few DSTs like the above example, then the 
inputs of both subproblems will be small. Moreover, the depth of the recursion 
is small in this case, hence the total time complexity will be reduced. As we 
show in the later section, there is an arc such that both generated subproblems 
have at least |A|/4 arcs after the trimming phase. In the figure 1, the arc b is 
such an arc. The both generated subproblems by b have about \A\/2 arcs after 
the trimming phase. The shapes of the obtained graphs are also like the figured 
graph. Hence, by choosing partitioning arcs similarly, the depth of the recursion 
is 0(log |A|). The total time complexity is also reduced about 0(log |A|) per a 
DST, if those phases take only 0(|A|) time. 

In fact, the trimming and balancing algorithm for this problem described in 
the followings is not so simple like the above. We have to remove some more 
unnecessary parts from inputs. The trimming phase takes 0(|A|log|A|) time, 
hence the total computation time is 0(log^ |A|) per a DST. 

The trimming and balancing approach decreases the time complexities of 
enumeration algorithms, but it is not easy to estimate a good upper bound 
of those time complexities. The above example is simple one. There are some 
difficult algorithms to analyze their time complexities. In this paper, we also show 
a technique for analyzing the time complexities of enumeration algorithms. We 
explain it in the following. 

Before explaining our analysis, we introduce a virtual tree called an enu- 
meration tree of an enumeration algorithm, which expresses the structure of the 
recursive calls. For a given enumeration algorithm and its input, let V be a vertex 
set whose elements correspond to all recursive calls occurring in the algorithm. 
We consider an edge set f on V x V such that each whose edge connects two 
vertices if and only if a recursive call corresponding to one of the vertices occurs 
in the other. Since the structure of a recursive algorithm contains no circulation, 
the graph T = {V,£) forms a tree. This tree is called an enumeration tree of the 
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algorithm. The root vertex of the tree corresponds to the start of the algorithm. 
To analyze enumeration algorithms, we show some properties of enumeration 
trees which satisfy for any input. 

To analyze the time complexity of an enumeration algorithm, we consider 
the following distribution rule on the enumeration tree. Let T be an enumera- 
tion tree, and D{x) be the number of descendants of a vertex x oiT . Suppose 
that T(x) is an upper bound of the time complexity of an iteration x. We also 
suppose that T is an upper bound of maxa,g 7 -{r(x)/D(x)}. Our analysis uses 
a parameter T* . The distribution is done from a vertex to its children in the 
top-down manner. For a vertex x of the enumeration tree, let Tp{x) be the com- 
putation time distributed by the parent of x to x. We distribute Tp{x)+T{x) — T* 
of the computation time to its children such that each child receives the com- 
putation time proportional to the number of its descendants. We distribute the 
computation time of each child recursively. 

By this distribution rule, some vertices may receive much computation time. 
Thus we define excess vertices for a specified positive constant a > 1, and stop 
the distribution on the excess vertices. A vertex x is called excess if Tp{x) + 
T{x) > aTD{x). The children of an excess vertex receive no computation time 
from their parent. The distribution rule is also applied to the descendants of 
excess vertices. By this new rule, Tp{x) is bounded by aTD{x) for any vertex a;, 
since the computation time distributed from a parent to its child is proportional 
to the number of descendants of the child. 

After the distribution, no vertex except excess vertices has more than 
0(T -I- r*) on it. Next, we distribute the computation time on each excess ver- 
tex X to all its descendants uniformly. Since the excess time Tp{x) + T{x) — T* 
is bounded by [a + \)TD{x)^ each descendant receives at most (a -|- 1)T time 
from an excess ancestor. Let A* be an upper bound of the maximum number of 
the excess vertices on a path from the root to a leaf. By using X* we obtain an 
upper bound 0(T* -|- TX*) of the time complexity per an iteration. From these 
facts, we obtain the following theorem. 

Theorem 1. An enumeration algorithm terminates in 0(T* -\- TX*) time per 
an iteration. □ 

Our analysis requires T and X*. To obtain a good upper bound of the time 
complexity, we have to set X* and T to sufficiently good values. As a candidate 
of r, we can utilize m&yix^TT(x) / D(x) where D{x) is a lower bound of D(x). 
In the enumeration tree, it is hard to identify excess vertices, although we can 
obtain an efficient upper bound X* . Let x and y be excess vertices such that y 
is an ancestor of x and no other excess vertex is in the path Pyx from j/ to a; in 
the enumeration tree. Note that Pyx has at least one internal vertex. 

Lemma 1. At least one vertex w of Pyx \ y satisfies the condition that T(w) is 
larger than the sum of over all children u of w. 

Proof. If Pyx \ y includes no such vertex, then all vertices w of Pyx \ y satisfy the 
condition that Tp{w) < aTD{w). It contradicts to the assumption of the state- 
ment. We prove it by induction. Any child of y satisfies the condition since y is an 
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Fig. 2. The enumeration tree and computation time on each vertex: The vertices 
satisfying the condition of Lemma 1 are drawn by emphasized circles, and all 
leaves are drawn by rectangles. In this tree, we can set T to 7, and X* to 2. 



excess vertex. Suppose that a vertex w of Pyx\y holds Tp{w') < aT{w'), where w' 
is the parent of w. Then Tp{w) < (a + l)T{w')D{w) / D{w') . From the assump- 
tion, T{w') is not greater than the sum of -^^^TD(u) over all children u of w' . 
Since the sum of D{u) is not greater than D{w'), we have T{w') < -^^r^TD{w'). 

Therefore we have Tp(w) < /D(w') = aTD{w). □ 

From this lemma, we can obtain X* by estimating an upper bound of the 
number of vertices satisfying this condition in any path from the root to a leaf. 
Similarly, we can obtain the following corollary. 

Corollary 1 If T = ma,Xx^T{T{x)/D{x)}, a vertex w of Pyx \ y satisfies that 
D(w) is larger than the sum of over all children u of w. □ 

These conditions can be easily checked, and are often sufficient to analyze. 
In the following sections, we describe one of them, that for DSTs. To see the 
algorithms for perfect matchings, refer [-5]. 

3 Enumerating Directed Spanning Trees 

In this section, we consider an enumeration algorithm for DSTs. The simple bi- 
nary partition algorithm explained in the above section is proposed by 
H.N.Gabow and E.W. Myers [1] in 1978. Our algorithm is obtained by adding 
a trimming phase and a balancing phase to this algorithm. We describe these 
algorithms in the following sections. 

3.1 A Trimming Algorithm 

Our trimming algorithm removes unnecessary parts of input. Firstly, the algo- 
rithm removes all multiple arcs since at most one of multiple arcs can be included 
in a DST. Next we see characterizations for unnecessary arcs which are included 
in all DSTs or no DST. 
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Fig. 3. Back arc a is included in a DST since i{h{6, 6)) = 2. Back arc b is included 
in no DST since i{h{7,7)) = 3 and z(/i(8,8)) = 7. 



Property 1 For an arc (u,v), there is a DST including it if and only if there 
is a simple dipath from r to u not including v. □ 

Property 2 For an arc (u,v), there is a DST not including it if and only if 
there is a simple dipath from r to v not including (u,v). □ 

From Property 2, if all arcs not satisfying the former condition are removed, 
then an arc is not included in a DST if and only if it shares its head with the 
other arc. Let T be a depth-first search tree of G with the root r. For a non-back 
arc a of T, the dipath from r to the tail of a in T does not include the head 
of a. Thus the former condition holds for any non-back arc. In the following, we 
show a way for checking the condition for back arcs. 

Let us put indices i{v) to all vertices v in the order of visiting of the depth- 
first search. We denote the unique path in T from a vertex u to its descendant v 
by Puv For an index f, vertices v and it, we call a dipath from it to ?; an i-bypass 
if all its internal vertices have indices larger than or equal to i. For a vertex v 
and an index i < i(f), let h(v,i) be the minimum index vertex among vertices 
satisfying that there are some i-bypasses from the vertices to v. Since any dipath 
to V from a vertex with an index smaller than v includes a common ancestor of 
them, h{v, i) is an ancestor of v. Note that an ancestor has an index smaller than 
any its descendant. By using these notations, we state the following lemmas ( 
see Figure 3 ). 

Lemma 2. For a back arc (u,v), there is a dipath from r to u not including v 
if and only if i{h(w,i{w))) < i{v) holds for a vertex w in P^u \ v- 

Proof. The “if” part of the lemma is obviously. Suppose that there is a simple 
dipath P from r to m not including v. Then P includes some vertices of \ v. 
Let w be the first vertex of them to appear in the path. The subpath of P from r 
to w includes some ancestors of v. Let w' be the last vertex of them to appear 
in the subpath. Note that no vertex of Pw'w is an internal vertex of the subpath 
from w' to w of P. Moreover, the subpath includes no vertex u with i(u) < i(v) 
since any path from u to v includes some common ancestors of u and v, since T 
is a depth-first search tree. It contradicts the choosing way of u>'. □ 
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From this lemma, we can identify unnecessary arcs by using h. Our algorithm 
firstly obtains h(v^i(v)) for the vertex v with i{v) = n. In each iteration, we 
decrease i one by one, and obtain h(v, i) from h(v, i + 1) for all v with i(v) > i. 
The updating method of h is based on the following properties. 

Lemma 3. Suppose that a vertex u has the index i — 1. 

(1) h{u,i — 1) is the minimum index vertex among all v and h(v,i) satisfying 
that there are arcs {v,u). 

(2) If h(v,i) h(v,i — 1) holds for a vertex v, then v is a descendant of u and 
holds h(v,i — 1) = h(u,i — 1). 

Proof. (1) Let P be an i — 1-bypass from h{u, z — 1) to u. If P is not an arc from 
h(u,i — 1) to M, the vertex v next to u satisfies h{v,i) = h(u,i — 1). Thus the 
condition holds. (2) If h{v, z — 1) h{v, i), u is included in any i — 1-bypass from 
h[v,i — 1) to V. Hence h{v,i — 1) = h{u,i — 1). Since (a) a dipath from u to a 
vertex with an index larger than i—1 includes a common ancestor of them, and 
(b) any ancestor of u has an index smaller than z — 1, z; is a descendant of u. 
Therefore h{y,i — 1) y^ h{v,i) holds only for descendants of u. □ 

From the lemma, we can see that i{h(v,j)) < i(h(v^i)) for any j < i. 

Lemma 4. Let u be a vertex satisfying that i(u) > z, and v be a descendant 
of u. If we have hfa,i) = h(v,i), then h{u,j) = h{v,j) holds for any j < i. 

Proof Since u is an ancestor of z;, i{h{u,j)) > i{h{v,j)) holds for any j. Suppose 
that an index j satisfies i(h{u^j)) > i(h{v,j)). We assume that j is the maximum 
index among indices satisfying the condition. From this assumption, there is 
a j-bypass from h(v,j) to v not including u. Note that the bypass includes 
the vertex v' whose index is j. Since T is a depth-first search tree, any dipath 
from v' to V includes some common ancestors of v and v'. These ancestors are 
also ancestors of zz, thus we can obtain a j-bypass from h(v,j) to u by merging 
the j-bypass from h(v,j) to v and Py'u- It contradicts the assumption. □ 

From the lemma, if we have h(u, i) = h(v, i) for u and its child v, the equation 
holds for all j < i. Hence, in the graph obtained by contracting u and v, h is pre- 
served. Thus we contract them if such a vertex and a child exist. For a vertex zz, if 
a descendant z; of zz satisfies that i{h{y, 

i{u) + 1)) > z(/i(z;, z(zz))), the child v' of zz included in the dipath from u to v 
satisfies the condition i{h{v',i{u) + 1)) > i{h{u,i{u))). Hence v' and zz satisfy 
that i{h(v' , i{u))) = i(h(u,i(u))). Therefore, in each iteration with the index z, 
we find the child v' of the vertex zz with z(zz) = z which maximizes i(h(v' ,i)) 
among all children of zz. If v' satisfies i{h{v\i)) < i{h{u,i)), all descendants v 
of zz satisfies i{h{v,i)) < i{h{u,i)). Otherwise, we have that h{v\i) = h{u,i), 
hence we contract zz and v'. We do this operation until there is no child v 
satisfying i{h{v,i)) = i{h{u,i)). After contracting all these vertices, no descen- 
dant z; of zz satisfies that h{u,i) < h{v,i + 1). Therefore no vertex v satisfies 
h{v,i) y^ h{v,i + 1) in the contracted graph. Hence we can update all h by this 
contracting operation. Since using some binary trees, the total time complexity 
of the trimming algorithm is 0(|A| log |U|). 
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3.2 A Balancing Algorithm 

For an arc a, let G' be the graph obtained by deleting all arcs sharing their 
heads with a except for a. Under the condition that G is a trimmed graph, the 
following lemma holds. Suppose that a dipath from a vertex to an arc is a simple 
dipath from the vertex to the head of the arc which includes the arc. 

Lemma 5. Let P be a dipath from r to a. If an are e is included in all DSTs 
or no DST of G' , then the head of e is on P. 

Proof. We state the lemma by proving its contraposition. For an arc e whose 
head is not on P, there exists a dipath Q in G from r to e, because of Property 1. 
If Q includes no arc of G \ G', then it is also included in G'. Thus e is included 
in some DSTs of G' . Otherwise, we consider the subgraph given by Q U P n G' . 
The subgraph includes a simple dipath from r to e, since the head of e is not 
on P. Thus, only arcs which have their heads on P may be included in no DST 
of G'. 

Since G is trimmed, an arc e whose head is not on P shares its head with 
some other arcs. The arcs are also included in some DSTs of G'. Hence they are 
also not included in a DST of G'. Therefore any arc included in all DSTs of G' 
or no DST of G' has its head on P. □ 

Let a' be an arc sharing its head with a, and P' be a dipath from r to a' . 

Lemma 6. If an arc e is included in all DSTs or no DST of G\a, then its head 
is on P' . In the case that at least two arcs share their heads with a, for any arc e 
of the arcs, there are both DSTs including e and not including e. 

Proof. G\a includes the graph obtained by deleting all arcs sharing their heads 
with a' except for a'. Thus, for any arc whose head is not on P' , there are both 
DSTs including the arc and those not including the arc in G \ a from Lemma 5. 
Therefore the first assertion is proved. 

For any arc of G \ a sharing its head with a' , a dipath from r to it does 
not include a. Hence, from Property 1, the arc is included in some DSTs of G' . 
Except for the case that only two arcs a and a' have their heads on v, there are 
some DSTs not including the arc in G \ a. □ 

By using these lemmas, we select a partitioning arc as follows. Let the weight 
of a dipath be the number of arcs whose heads are on the path. The weight of 
a dipath P is denoted by w(P). In the balancing phase, we will find an arc a* 
satisfying the conditions that (a) the weight of a dipath from r to a* is at 
most 3|A|/4, and (b) for an arc a' sharing its head with a*, the weight of a 
dipath from r to the tail of a' is at most |A|/2. If an arc a* satisfies these 
conditions, G \ a* includes at most |A|/2 + 2 arcs which are included in all 
DSTs or no DST. Under the condition that |A| > 8, |A|/2 + 2 < 3|A|/4. The 
graph obtained by removing all arcs sharing whose heads with a* also includes 
at most 3|A|/4 such arcs. We consider the method for finding such an arc a*. 

Let T be a DST of G. To select an arc, we consider the following three 
cases. (1) If there is a non-back arc (u,v) of T such that w{Pru) < 1^1/2 
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Fig. 4. Vertices r, r', u, x and w. The paths are Pm and P' . 



and w{Prv) < 1^1/2, then the arc of T whose head is v satisfies the above 
conditions. (2) In the other case, let u be the vertex maximizing w(Pm) among 
all vertices. We denote the vertex next to r in Pm by r'. Let P' be a simple 
dipath from r to r' not including the arc (r, r'). P' always exists since G is 
a trimmed graph. Let w be the vertex minimizing w{Pm) among all vertices v 
of Pm satisfying that w{Pm) > |^|/2. We suppose that x denotes the first vertex 
to appear in P' among vertices v of Pm with w{Pm) < 1^1/2. Since any vertex 
except r is the head of at least two arcs, at least 2|V| — 4 arcs of G have their 
heads not on w. On the other hand, at most |V| — 1 arcs have their heads on w, 
thus the number of arcs whose heads are w does not exceed |^|/2. Therefore the 
weight of r' is at most |^|/2, and x always exists in P'. Let / be the arc of P' 
whose head is x. We show an example of these vertices and arcs in Figure 3. If 
the subpath P" of P' from r to f satisfies w{P") < |^|/2, then the arc of T 
sharing its head with / satisfies the conditions (a) and (b). 

(3) If / does not satisfies these conditions, w{P") > \A\/2. From the definition 
of x, the vertices included in both P" and Pmi are at most r and w. Hence P" 
includes w since more than |A|/2 arcs have their heads on Pmi- Suppose that Pi 
denotes the path with the smaller weight among Puw and the subpath of P" 
from r to w. We also denote the other path by P 2 . Let d denote the number of 
arcs whose heads are w. Since PinP 2 = {r, w}, we have w(Pi) < {\A\ — d)/2+d = 
|A|/2 + d/2 < 3|A|/4, and w{P 2 \w) < |^|/2. 

From the above observations, there is an arc satisfying the above two condi- 
tions in any trimmed digraph. To find such an arc, what we have to do is only 
to find a DST and some dipaths. They can be done in 0(|A| -|- |V|) time. 

By utilizing these algorithms, we obtain a trimming and balancing algorithm. 
The algorithm takes 0(|A| log |V|) time for an iteration in the worst case. In the 
following subsection, we bound the time complexity more tightly by using the 
distribution of computation time. 
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3.3 Bounding the Total Time Complexity 

To estimate an amortized time complexity of our algorithm, we firstly estimate 
a lower bound of the number of descendants of a vertex of the enumeration tree 
by the following lemma. Let Gx denote the graph which an iteration x inputs. 

Lemma 7. If any arc of G is included in a DST and not included in the other 
DST, G includes at least \A\/2 distinct DSTs. 

Proof. Let T be a DST of G. From Property 1, for any non-tree arc a of T, there 
is a simple dipath P from r to a. Hence, for any non-tree arc, we can construct 
its own DST T' by adding P to T and deleting some arcs. Since there are at 
least \A\/2 non-tree arcs in G, G includes at least |A|/2 DSTs. □ 

From the lemma, we obtain a lower bound £>{x) = \E{Gx)\/‘2^ of D{x). 
Here D{x) is the number of descendants of a vertex x of the enumeration tree, 
and E(Gx) denotes the arc set of Gx- The computation time on a vertex x is 
0(|if(Ga;)| log |P|) time, hence we set T = 0(log |P|), and T* = 0(log |P|). 

From Corollary 1, we can bound the number of excess vertices by the number 
of vertices satisfying that D{x) > -|- D{x 2 )) where x\ and X 2 are the 

children of x. If the condition holds, we have that |if(Ga;)|/2 > -^^^(\E{Gx^)\/2.+ 
\E{Gx,)\/2), thus \E{Gx,)\ < ^|L;(G,)| - |^;(G,)|/4 for e“ach child x,. By 
setting a = 8, \E{Gxi)\ — /^)\^{Gx)\. Therefore any path of the enumeration 

tree from the root to a leaf has at most logg /7 |H| excess vertices. We obtain 
an upper bound X* = logg^^ |H|. From the Theorem 1, we have the following 
theorem. 

Theorem 2. All DSTs in a digraph can be enumerated in 0{\A\ log \V\ + \V\ + 
Nlog^ |P|) time and 0(|H| -|- |P|) space where N is the number of DSTs. □ 
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Abstract. We show that recursive circulant G(cd"‘,d) is hamiltonian 
decomposable. Recursive circulant is a graph proposed for an intercon- 
nection structure of multicomputer networks in [8]. The result is not 
only a partial answer to the problem posed by Alspach that every con- 
nected Cayley graph over an abelian group is hamiltonian decomposable, 
but also an extension of Micheneau’s that recursive circulant G(2"*, 4) is 
hamiltonian decomposable. 



1 Introduction 

We say a graph G is hamiltonian decomposable if either the degree of G is 2k and 
the edges of G can be partitioned into k hamiltonian cycles, or the degree of G 
is 2A: -b 1 and the edges of G can be partitioned into k hamiltonian cycles and 
a 1-factor, where a 1-factor of a graph is a 1-regiilar spanning subgraph. If G is 
hamiltonian decomposable then G is loopless, connected, and regular. 

It is necessary for a graph G to have a hamiltonian decomposition that G has 
a hamiltonian cycle. However, the condition is by far not sufficient. Many authors 
have been dealing with sufficient conditions for the existence of a decomposition 
of a graph into hamiltonian cycles. But still, the current status of the matter 
lies, for the most part, in the sphere of problems and conjectures. 

A survey on hamiltonian decomposition of graphs is provided in [3,4]. The 
complete graph with odd (resp. even) number n of vertices is decomposable 
into hamiltonian cycles (resp. paths). The complete fc-partite graph 
K(ni, ri 2 , • • • ,nk) is hamiltonian decomposable if and only if rii = ri 2 = • • • = nk- 
The following problem on hamiltonian decomposability of Cayley graphs is posed 
by Alspach[lj. 

Problem 1 Does every connected Cayley graph over an abelian group have a 
hamiltonian decomposition ? 

* This work was partially supported by the Catholic University of Korea under project 
no. 19980057. 
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Much of the focus of research has been directed towards proving special cases 
of the problem. The answer is yes when the degree of the graph is five or less. 
The product of any number of cycles Cm x Cm x • • • x Cm is hamiltonian 
decomposable. The product of cycles is isomorphic to the Cayley graph of the 
corresponding cyclic groups with the standard generating set. The m-cube Qm, 
the Cayley graph of the product of m copies of a cyclic group Z 2 , is hamiltonian 
decomposable. 

Recursive circulant is a graph proposed for an interconnection structure of 
multicomputer networks in [8]. The recursive circulant C{N, d), d > 2, is defined 
as follows: the vertex set V = {vo,vi,V 2 , - ‘ ‘ and the edge set E = 

{{vi,Vj) I there exists k, 0 < k < [log^fV] — 1, such that i + d^ = j (mod N)}. 
Here each d^ is called a jump, and the size of the edge (vi,Vj) is C{N,d) 
also can be defined as a circulant graph with N vertices and jumps of powers 
of d, d^, - ■ ■ , d^^°^'‘ Examples of G{N, d) are shown in Fig. 1. 




Fig. 1. Examples of G{N, d) 



Recursive circulant is a Cayley graph over an abelian group, in more pre- 
cise words, the Cayley graph of the cyclic group Zn with the generating set 
{d°, d^, • • • , This paper is concerned with hamiltonian decompos- 

ability of recursive circulants. Micheneau shows that recursive circulant G{N, d) 
with N = 2^ and d = 4 is hamiltonian decomposable [7]. We shall prove that 
G{N,d) has a hamiltonian decomposition when N = cd™, 1 < c < d for some 
integer c and d. The result is an extension of Micheneau’s as well as a progress 
on Alspach’s problem. 

From now on, all arithmetics are done modulo cd’” using the appropriate 
residues. Graph theoretic terms not defined here can be found in [2]. This paper 
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is organized as follows. We give some preliminaries and related works in Sec- 
tion 2, and prove the main theorem in Section 3 that G{cd’^,d) is hamiltonian 
decomposable. Finally, concluding remarks are given in Section 4. 



2 Preliminaries and Related Works 

Let us consider the classes of graphs containing recursive circulants. Recursive 
circulant G{N, d) is a circulant graph. A circulant graph is defined as a Cayley 
graph over a cyclic group. Every Cayley graph over a general group is vertex sym- 
metric, and thus regular. These inclusion relationships are shown in Fig. 2 (a). 
Depending on restriction to N and d, the recursive circulants also have inclusion 
relationships shown in Fig. 2 (b). 



regular graph 



vertex symmetric graph 



Cayley graph 



Cayley graph over an abelian group 



circulant graph 


. 


recursive circulant 

) 



' G(N,d) 






\ 




G(ccT,d) 








G(2",2'‘) 


\ 




. 


G(2".4) 


G(2",2) 





(a) 



(b) 



Fig. 2. Graph classes 



Hamiltonian decomposition is one of the most interesting strong hamiltonic- 
ity, that is, a hamiltonian property which implies the existence of a hamiltonian 
cycle. Hamiltonian connectedness is also a strong hamiltonian property. A graph 
is hamiltonian connected if there is a hamiltonian path joining every pair of ver- 
tices in the graph. 

When one is trying to prove that every graph in a class has a certain hamil- 
tonian property and fails to do so, then typically two approaches are taken. One 
is to restrict the class of graphs and prove that the property holds over the re- 
stricted class, while the other is to restrict the property and prove that every 
graph in the class satisfies the restricted property. Many researches on hamil- 
tonicity of graphs in the literature take these approaches. Some well-known con- 
jectures and impressive properties on hamiltonicity of graphs in the classes that 
we have interest are shown below. 
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Conjecture 2 (a) Every connected vertex symmetric graph has a hamiltonian 
pathfLovaszJ. 

(b) Every connected Cayley graph with three or more vertices has a hamiltonian 
cycle[Chen] . 

(c) Every 2k-regular connected Cayley graph on a finite abelian group is hamil- 
tonian decomposable [Alspach] . 

Theorem 3 (a) Every connected Cayley graph over an abelian group is hamil- 
tonian connected[5]. 

(b) Recursive circulant G(2™,4) is hamiltonian decomposable[7j. 

Recursive circulant G{N,d) with three or more vertices has a hamiltonian 
cycle. Obviously, the set of edges of size 1 in G{N, d) form a hamiltonian cycle. 
Theorem 3 (a) shows that G{N, d) has a strong hamiltonicity of hamiltonian 
connectedness. 

Recursive circulant G{N, d) has a recursive structure when 
N = cd^,l < c < d. In other words, G{cd^,d) can be defined recursively 
by utilizing the following property. 

Property 4 Let Vi be a subset of vertices in G{cd"^, d) such that Vi = {vj \ j = 
i (mod d)}, m > 1. Eor 0 < i < d — 1, the subgraph of G{cd'^,d) induced by Vi 
is isomorphic to G{cd"'~^,d). 

G{cd™, d), m > 1, can be constructed recursively on d copies of G{cd"^~^ ,d) 
as follows. Let Gi{Vi,Ei), 0<i<d — 1, bea copy of G{cd^~^ ,d). We assume 
that Vi = {vq,v{, - ■ ■ and Gi is isomorphic to G{cd^~^ , d) with the 

isomorphism mapping n® to Vj. We relabel n® by Vjd+i- The vertex set V of 
G{cd^,d) is Uo<i<d-i Vij and the edge set E is (_Jo< 2 <c/— i ^ i where — 
{{vj,Vj') I j + 1 = f (mod cd"®)}. The construction of G(32,4) on four copies 
of G(8,4) is illustrated in Fig. 3. Every edge in X is of size 1, and X forms a 
hamiltonian cycle, called the fundamental hamiltonian cycle, in G{cdX,d). 

We denote by 5^ the degree of G{cd'^,d). The degree of a graph is the 
minimum degree over all vertices in the graph. 5m is greater than 5m-i by two, 
that is, 5m = 5m-i + 2, m > 1. in a closed-form is shown below. 

{ 2m — 1 if c = 1 and d = 2; 

2m if c = 1 and d 2] 

2m -k 1 if c = 2; 

2m -|- 2 if c > 2. 

3 Hamiltonian Decomposition of G{cd^, d) 

In this section, we prove the following main theorem by mathematical induction 
on degree of recursive circulant G(cd"®,d). We have two cases depending on the 
size of d. 

Theorem 5 Recursive circulant G{cdX,d) is hamiltonian decomposable. 
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G(8.4) 



G(8.4) 



G(8.4) 



G(8,4) 



Fig. 3. Recursive structure of G(32,4) 



3.1 Case of d > 4 

We show that G{c<F^ , d) has a hamiltonian decomposition such that hamiltonian 
cycles in the decomposition satisfy the following two conditions Cl.l and Cl. 2. 

Cl.l For every hamiltonian cycle Cj in the decomposition, there exists an in- 
dex kj such that Cj passes through two adjacent edges (vkj,Vkj+i) and 
(vkj+i,Vkj+ 2 ) of size 1. 

Cl. 2 For any pair Cj and Cj> of hamiltonian cycles in the decomposition, 
{vkj,Vkj+l,Vkj+2}r]{vk.,,Vk.,+l,Vk.,+2} = 0 - 

Let us first consider a hamiltonian decomposition of C{cd"^,d) with small 
degree. C{cd"^,d) with degree two is the fundamental hamiltonian cycle itself. 
When the degree of C{cd^,d) is three, G{cd^,d) consists of the fundamental 
hamiltonian cycle and a 1-factor. In both cases, the natural decomposition leads 
to a hamiltonian decomposition satisfying the above two conditions. 

Now we try to find a hamiltonian decomposition of G{cd"^,d) by using the 
fact that G{cd'^~^, d) has a hamiltonian decomposition satisfying the two condi- 
tions. The degree of G{cdJ^ , d) is greater than that of G{cd^~^ ,d) by two. Note 
that G{cd'^,d) has a recursive structure, that is, G{cd"^,d) is constructed on d 
copies of G{cd"^~^ ,d). We let Gi denote a copy of G{cd"^~^ ,d), and assume that 
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the vertices of Gi is {vq,v\, ■ ■ ■ The vertex Vj is relabeled by vjd+i- 

Hereafter, two labels for the vertex are used interchangeably. 

We denote by X the fundamental hamiltonian cycle in 0(0(1"^, d). Let C* 
denote a hamiltonian cycle in the hamiltonian decomposition of Gi. Cj passes 
through the edges and some kj by condition Cl.l. 

The basic idea of the proof is that we merge d different cycles Cj, 0 < i < d, 
into one hamiltonian cycle Cj in G{cd^,d) by exchanging some edges of C*’s 
with edges in X while we keep X being a hamiltonian cycle. We merge another d 
different cycles C*, into another hamiltonian cycle Cji by these edge exchange 
operations. We repeat this until a hamiltonian decomposition of G(cd"‘,d) is 
obtained. 

We represent Gj and X as a sequence of vertices. It is convenient to rep- 
resent explicitly only the vertices concerned with edge exchange operations as 
follows. Here P* is a path from to (excluding the start and end ver- 
tices) passing through all the vertices in Gi except vl_, ^ 

is a path from in G{cdX,d) passing through all the other ver- 

tices not represented explicitly. In other words, P passes through the vertices 
{u* I 0 < i < d, 0 < j < cdX~^,j yf kj,kj + 1, kj + 2} in some order. 












..d-1 



..d-1 



’ '^kj ’ '^ki + l’'^ki + l’ ■ ■ ■ ; 
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d-1 

kj+2^ 



P 



The edge exchange operation depends on the parity of d. 

Case 1 even d 

d hamiltonian cycles Gj ’s are merged into a hamiltonian cycle Cj in G{cd"^, d) 
as follows(See Fig. 4 (a)). Note that X still remains a hamiltonian cycle in 
G(cd'",d). 



Ci=vl,vl,P\vl 



■ ■ ■ 7;'^“^ pd-4 ..d-i „,d-3 pd-3 
fej +2; I ‘^fc, +2> ’^fej-|-2> 



,,d-3 



„d-4 



pd—2 „,d—2 d—2 

^ T^ki Xki+n ^ki+1^ ^ki+n' 



,,i ,,o ,d-i p' 

^ ‘^ki + lXki + lXki 



d-1 



,.d—l „.d— 1 ,,0 pO 



'\j+2^'^kj + lT'^kj+2^ ^ 






'^kj + lXkj+2^^kj+2^'^kj + l^'^kp ' 
,,d-3 



..d-4 rf-4 









..d-l 



^ d-3 d-2 d-1 d-1 d-2 d-2 a-i p 

%• + !’% ’ % + l’ % + l’ %-+2’ %-+2’ 



V 



d-2 

fci+2’ 



d-3 
kj -\-2 ^ 



The edge exchange operations can be performed independently since the 
hamiltonian decomposition of Gi always satisfies the condition Cl. 2. Thus every 
hamiltonian cycle in Gi can be merged into a hamiltonian cycle in GicdP^d). 
If Gi has a 1-factor in the hamiltonian decomposition, Gi has odd degree and 
even number of vertices. The union of 1-factors in Gds forms a 1-factor of 
G(cd"*,d). We have successfully constructed the hamiltonian decomposition of 
G(cd"*,d) in this case. 

Now let us prove that the hamiltonian decomposition presented satisfies the 
conditions Cl.l and Cl. 2. Cj passes through the edges 
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and Eventually in G{cd"^,d), they are adjacent edges of size 1. 

X passes through the edges and ^ satisfies the 

condition Cl.l. The three vertices associated with Cj are different from the three 
vertices associated with X . From the fact that any pair C] and C*, of hamil- 
tonian cycles in the decomposition of G® satisfies the condition Cl. 2, the three 
vertices associated with Cj are disjoint from the three ver- 
tices associated with Cj/. Thus we have a hamiltonian 

decomposition of G{cdX,d) satisfying Cl.l and Cl. 2. 

Case 2 odd d 

The proof of this case is similar to that of Case 1. Cj and X after the edge 
exchange operation are shown below(See Fig. 4 (b)). 
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Cj and X are hamiltonian cycles in G(cd™,c?). Cj passes through edges 
(vkj+iXlj+i) and and X passes through 

and ('Cfe.+ 2 j 'I'fe + 2 )- It is easy to check that the hamiltonian decomposition pre- 
sented satisfies the conditions Cl.l and Cl. 2, and thus omitted here. 



3.2 Case of d = 2, 3 

We show that G(cd"‘,d) has a hamiltonian decomposition which satisfies the 
following two conditions C2.1 and C2.2. 



C2.1 For every hamiltonian cycle Cj in the decomposition, there exists an in- 
dex kj such that Cj passes through an edge {vk^ ,Vkj+i) of size 1. 

C2.2 For any pair Cj and Cj' of hamiltonian cycles in the decomposition, 
{vkj,Vkj+i} n {vk.,,Vk.,+i} = 0. 

We observe that G{cd"^,d) with degree two or three has a hamiltonian de- 
composition satisfying conditions Cl.l and Cl. 2, and that the decomposition 
also satisfies the above two conditions C2.1 and C2.2. We have two cases. 

Case 1 d = 3 

Cj and X are shown in the following. Here P* is a path from Vk.+i to v\. 
passing through all the vertices in G* excluding the start and end vertices. P is 
a path from to passing through all the vertices {ti* | 0 < i < d, 0 < j < 

cdP~^,j yf kj,kj + 1} in G{cdX',d). 



X = vl. 



1 2 






304 



Jung-Heum Park 





Fig. 4. Case of d > 4 



Cj and X after the edge exchange operation are as follows(See Fig. 5 (a)). 
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Cj passes through edge (vl.jvl.) of size 1, and X passes through edge 
The vertices {Vkj’’^kj} associated with Cj are disjoint from the 
vertices associated with any other hamiltonian cycle Cji. Thus 

the hamiltonian decomposition satisfies the conditions C2.1 and C2.2. 

Case 2 d = 2 

Cj and X are shown in the following. P* and P can be defined in a similar 
way to Case 1. 



C} = vL,vl.„P^ 



X = v° 



‘"fc,+n 



,V 
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Cj and X after the edge exchange operation are as follows(See Fig. 5 (b)). 

Cj = vl. , 4 . , pi , .+ 1 , .+ 1 , P° 

X = vl vl vl ^^,P 
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Cj passes through edge {v'l. , vl_) of size 1. But all edges in X associated with 
the edge exchange operation are not of size 1 . If we can choose a vertex in Gq 
such that v'^ ^ over all cycles C° in the hamiltonian decomposition 

of Gq, then the edge must be in X and of size 1. The existence of such a 

vertex can be shown by a counting argument. Gq is a copy of G{cdX‘~^,d) with 
d = 2, that is, G(2"*“^,2). G(2™“^,2) has degree 2(m — 1) — 1 = 2m — 3 and 
[(2m — 3)/2j = m — 2 hamiltonian cycles in the decomposition. Two vertices 
and are associated with each hamiltonian cycle G° and 2(m — 2) vertices 

in total. The number 2(m — 2) is smaller than the number 2"*“^ of vertices in 
G(2™-i^2). Thus, we can always choose such a vertex in Gq. Obviously, the 
hamiltonian decomposition satisfies the condition C2.2. 




(a) d = 3 



(b) d = 2 



Fig. 5. Case of d = 2, 3 



4 Concluding Remarks 

We have shown that recursive circulant G(cd™,d) is hamiltonian decomposable. 
A hamiltonian decomposition can be constructed recursively by following the 
proof given in this paper. Associated with the Alspach’s problem, we give con- 
jectures on hamiltonian decomposability of recursive circulants and circulant 
graphs. 

Conjecture 6 (a) Recursive circulant G(N,d) is hamiltonian decomposable. 

(b) Every connected circulant graph is hamiltonian decomposable. 

A directed version of the hamiltonian decomposition problem, called dihamil- 
tonian decomposition, is decomposing a graph G into directed hamiltonian cycles, 
when we regard an edge (v, w) of G as two directed edges (v, w) and {w, v) of 
opposite direction. The problem has an application in the design of reliable algo- 
rithms for communication problems such as broadcasting and multicasting under 
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the wormhole routing model [6]. We pose open problems on dihamiltonian decom- 
posability of recursive circulants and m-cubes. Both recursive circulant G(8,4) 
and 3-cube Qa not dihamiltonian decomposable due to the fact that every 3- 
regular graph with a multiple of 4 vertices is not dihamiltonian decomposable [9]. 

Problem 7 (a) Does recursive circulant G(2™,4) with m > 4 have a dihamil- 
tonian decomposition? 

(b) Does m-cube Qm with m> 4 have a dihamiltonian decomposition? 
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1 Introduction 

Consider a square grid graph G„ on n? grid points {1, . . . , n} x {1, . . . , n} in 
the plane, where every unit-distance pair of points are connected by an edge 
segment. A grid filling curve, or GFC for short, is a hamilton path of G„ between 
two corner vertices (1, 1) and {n,n). Since a parity argument shows that such a 
hamilton path exists only if n is odd, we will assume that n is odd throughout the 
paper. It is easy to construct a GFC under this assumption: a standard snake- 
order traversal suffices. We are, however, interested in generating random GFCs 
(see below for motivations). To this end, we define a simple local operation called 
a flip that converts a GFC into another and apply a sequence of random flips to 
the standard GFC hoping that the result will be a random GFC in some useful 
sense. The first thing we would want to establish in such a random generation 
method is that every GFC has a positive probability of being generated. We 
show in this paper that it is indeed the case: our result states that given any two 
GFCs, there is a sequence of flips that converts one into the other. 

Grid filling curves are not only interesting by themselves but also applied to 
various problems, such as efficient disk seek methods [3] and many others. One 
typical application among them is found in digital halftoning which is a technique 
to convert an image with several bits for brightness levels into a binary image 
consisting of black and white dots. One well-known method is the one called 
’’error diffusion” [5] which propagates quantization errors to neighboring pixels 
following the order of raster scan. Unfortunately, the regularity of its scanning 
order produces regular patterns known as moire in large uniform regions. To 
get rid of such regular patterns for better image quality we need to incorporate 
some randomness, which could be introduced by random grid filling curves. In 
fact, several methods based on this idea are proposed that use space filling 
curves such as Peano curve [4], Hilbert curve [7], and some others[l,2,8]. In [3] a 
general scheme for defining a class of space filling curves based on a grammar is 
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presented. Experimental results suggest that those conventional discrete space 
filling curves or those defined by recursive grammars still retain regularities. 
Thus, we need really random-looking grid filling or space filling curves. Although 
the grammar-based approach provides some freedom in curve generation which 
can be exploited to introduce randomness, it suffers from the inherent constraint 
that (when the space is discretized into a grid) the side length of the grid must 
be a power of some constant. Our random generation method is free from this 
constraint since it is applicable to any n x m grid where either n or m is odd. 

In their recent work on the hamiltonicity of solid grid graphs (which are a 
generalization of the grid graphs we are dealing with), Umans and Lenhart[6] 
adopts the following approach: start with a spanning set of cycles and apply 
stages of local modifications, each stage reducing the number of cycles by one, 
until a hamilton cycle is reached or no further reduction in the number of cycles 
is possible. Their basic operation is also called a flip (with a different meaning) 
which either merges two cycles into one or splits one cycle into two. Let us call 
this type of flips as Umans-Lenhart flips or UL-flips for short. If we regard a GFC 
as a hamilton cycle with a help of an additional edge between the two corner 
vertices, our flip on a GFC consists of two UL flips, one of which splits the GFC 
into two cycles while the other merges those two back into a GFC. Thus, we 
quite naturally encounter similar difficulties as encountered by those authors. 
Nonetheless, the two problems are quite independent, different techniques are 
used, and neither result implies the other. It appears that their result may imply 
the convertibility among GFCs via UL-flips (although the proof is not entirely 
clear). Even if this is the case, UL-flips are not suitable for our purposes of 
random generation, because a sequence of UL-flips, as opposed to a sequence of 
our flips, may split a GFC into a collection of an arbitrary number of cycles. In 
our random generation context, this means that we would do a random walk in 
a space in which the set of GFCs forms only a tiny subspace. Our flip combines 
two UL-flips so that we always stay within this subspace. Thus, convertibility 
via our flips is not only logically stronger than convertibility via UL-flips but 
also very essential in our applications. 

We believe that our result can be extended to the case where the entry and 
exit points of a GFC are arbitrarily given as long as the parity condition is met. 
Such a generalization would be meaningful when we try to integrate our method 
into a recursive approach where a GFC of a subgrid is used as a component of a 
GFC of the entire grid. We have not, however, worked out technical details yet. 

2 Notation and Definitions 

Our grid graph G„, as defined in the introduction, is a plane graph with (n— l)x 
(n— 1) square faces. These faces will be referred to as cells. The cell with vertices 
(i,j), {i + 1, j), {i,j + 1), {i -I- 1, j -I- 1) is identified with the index (i, j) so that 
the set of cells is {1, . . . ,n — 1} x {1, . . . , n — 1}. For convenience we add cells 
around the original plane graph, which results in an extended set of cells Cn = 
{0, 1, . . . , n} X {0, 1, . . . , n}. We call a cell {i, j) a boundary cell if either i S {0, n} 
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or j S {0,n}; otherwise it is an internal cell. The cell graph is a graph on the 
vertex set Cn in which two cells (i, j) and (i' ,j') are adjacent if they are adjacent 
in the usual sense, i.e., \i — i'\ + \j — j'\ = 1. 

Each GFC may be associated with a coloring of cells in the following manner. 
A cell-coloring A : C„ — *■ {black, white} assigns one of the two colors to each 
cell, with the following fixed coloring of the boundary cells: each cell (0,j) in 
the 0th row and each cell (i,n) in the nth column except for cell (n, n) are 
colored black and each cell (n,j) in the nth row and each cell (i,0) in the 0th 
column except for cell (0, 0) are colored white. A cross in a cell-coloring is a set 
of four cells ci = (f, j), C2 = {i,j 1), cs = {i 1, j), C 4 = {i 1, j -I- 1) forming 
a square such that ci and C4 are of the same color and C2 and C3 are of the other 
color (Figure 1). In a cell-coloring A, a cycle of the cell graph is called a black 
{white, resp.) cycle of A if A colors every cell in the cycle black (white, resp.); it 
is called a monochromatic cycle of A if it is either a black or white cycle of A. 
A well-formed cell-coloring, or WFC for short, is a cell-coloring that does not 
have either a cross or a monochromatic cycle. Observe that the subgraph of the 
cell graph induced by all the black (white) cells of a WFC is connected (because 
otherwise there would be a cycle of the opposite color) and hence is a tree. 



o# 


• 0 


•0 


09 


(a) 


(b) 



Fig. 1. Forbidden subarrays: crosses. 



Proposition 1. GFCs and WFCs are in one-to-one correspondence in the fol- 
lowing manner. 

(1) Let P be a GFC. As we follow P from the entry point (1,1) to the exit 
(n,n), color the internal cells to the left of P black and those to the right of P 
white. Then, together with the fixed coloring of the boundary cells, the resulting 
cell-coloring is a WFC. 

(2) Let A be a WFC. Then, the set of edges of the grid graph Gr which border 
two differently colored cells form a GFC . 

See Figure 2. Following a convention in computer vision, the (0, 0) cell is 
placed at the upper-left corner in our illustrations. In the sequel, we will work 
on WFCs rather than directly on CFCs, because WFCs visualize our definitions 
and facilitate our reasoning. 

3 Convertibility by Successive Flips 

Our local operation on CFCs is best described as a flip on WFCs. A flip is an 
unordered pair {c, d} of distinct internal cells c,d€ Cn. We say that flip {c, d} is 
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Fig. 2. A well-formed coloring and the corresponding grid filling curve. 



applicable to a WFC A, if A{c) ^ A{d) and swapping the values of A{c) and A{d) 
in A results in another WFC. The goal of this section is to show that, given any 
two WFCs A and B, there is a sequence of flips which, when applied successively 
to A, produces B. To this end, we define the canonical WFC, which we denote 
by A*, as follows: for each internal cell (i,j), A*{i,j) is black if and only if i is 
even (see Figure 3). 
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Fig. 3. Canonical WFC. 



Theorem 1. Given any WFC A, there is a sequence of at most flips that 
transforms A into the canonical form A* . 

The rest of this section is devoted to the proof of the above theorem. We need 
some more definitions. Totally order the cells of Cn lexicographically: (i,j) < 
(i', f) ^ i < i' or {i = i' and j < f). For a WFC A, we say that a cell c is fixed 
in A if either c is a boundary cell or an internal cell satisfying A{d) = A*(c') 
for every c' < c; otherwise cell c is floating. We call a flip {c, d} valid for A 
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if it is applicable to A and both c and d are floating in A. Our strategy is to 
fix the cells one by one in the lexicographically increasing order of cells until 
all the cells are fixed. Thus, if c is the lexicographically smallest floating cell 
in the current cell-coloring A, we seek a sequence of valid flips that leads to a 
cell-coloring in which c is fixed. This is unfortunately not a simple task as it 
might seem. See Figure 4 for example. In this figure, the black cell c = (1,7) is 
the smallest floating cell. There is no d such that {c, c?} is a valid flip. We may 
construct a sequence of flips, starting from the flip of two horizontally adjacent 
cells in the center, that eventually inverts c. Note that every flip immediately 
applicable to this example inverts at least one cell that already has the right 
color in the canonical coloring (so that any single flip does not make an obvious 
progress towards the canonical coloring). 




Fig. 4. An instance of WFC in which fixing the lexicographically smallest next 
cell requires more than one flips. 



This example suggests that a cell of a particular type is of interest, which we 
call a spike. A spike of cell-coloring A is a cell c = (i,j) such that all the three 
cells {i — l,j — 1), {i — 1, j), {i — l,j + 1) have the same color as c in A, unless 
they belong to the leftmost or the rightmost column (we allow a cell among the 
three to be of the opposite color if it belongs to the 0th or the nth column) . Note 
that if c = (i, j) is a spike then the two cells (i, j — 1), (z, j -I- 1) next to c are of 
the opposite color to c. See Figure 5(a). We say that a spike c = (z,j) of A is 
robust if all the three cells {i — l,j — 1), (z — 1, j), (z — l,j + 1) are of the same 
color as c (even if they are boundary cells) and, in addition, cell (z — 1, j — 2) or 
(z — 1, j-l-2) exists and has the opposite color of c. Note that A(z — 1, j — 2) ^ A{c) 
(A(z — 1, j-l-2) ^ A(c), resp.) implies A{i,j — 2) ^ A{c) (A{i,j + 2) ^ A(c), resp.) 
since otherwise A would contain a cross. See Figure 5(b). Finally, we call a spike 
c = (z, j) short if cell (z -|- 1, j) below c has the opposite color to c. Otherwise call 
it long. 
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Fig. 5. (a) spike and (b) robust spikes. 



Proposition 2. Every spike of a cell-coloring A is floating in A. Moreover, the 
lexicographically smallest floating cell of A is a spike of A. 

Let c be a spike of A. We are interested in finding a valid flip for A of the form 
{c,d}. To seek such a cell d, invert the color of c alone in A and let A denote the 
resulting cell-coloring. Then A is still cross-free (since c now assumes the same 
color as the two cells on its sides) but contains a cycle of the color opposite to 
the original color of c. If c is a short spike then the cycle introduced is the 6-cycle 
formed by the six cells (*',/), i < *' < * + 1, j — 1 < / < j + 1 (Figure 6(a)). 
If c is a long spike then the cycle introduced is unique and encloses at least one 
cell of color A{c) in its inside (Figure 6(b)). In either case, we will refer to this 
introduced cycle as the cycle associated with spike c. 
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Fig. 6. (a)6-cycle associated with a short spike and (b)cycle associated with a 
long spike. 



Let c be a spike of WFC A and let Q be its associated cycle. If d is a 
cell such that {c, d} is a valid flip for A, then d must belong to Q because 
otherwise the monochromatic cycle Q would survive in the well-formed cell- 
coloring that results from applying {c, d} to A. We call such d an opener of Q, 
because inverting d opens up the cycle Q. We want to find an opener or, when 
necessary, apply preliminary sequence of flips to produce an opener of Q. 

Proposition 3. Let c he a long spike of a WFC A and let Q be its associated 
cycle. Let d ^ c be a cell on Q and let d\ and d 2 he the two cells adjacent to d 
but not on Q. Then, d is an opener of Q if and only if 
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(52) d\, d, and c ?2 He on the same row or column, and 

(53) di and c ?2 have the color opposite to that of d. 
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We call a cell d c on Q a, potential opener of Q if it satisfies conditions (SI) 
and (S2) but not necessarily (S3). We are particularly interested in potential 
openers for which 

(1) the cells di, d, and ^2 in the condition (S2) above are in the same column, 
i.e., d = (i,j), di = {i- 1, j) and d 2 = {i + 1, j), and 

(2) the color of c ?2 is the same as the color of d. 

Clearly, d 2 is a spike when these conditions are satisfied: we say that potential 
opener d has spike d 2 in this case. 

For cell c = (i, j), let is(c) = i + j denote the index sum of the cell. 

Proposition 4. Let c he a long spike of a WFC A and let Q be its associ- 
ated cycle. Let s = max{is(d) \ d is an internal cell belonging to Q} and suppose 
is(c) < s — 2. Then, Q has a potential opener d with \s{d) > s — 1 which either 
is indeed an opener or has a robust spike. 

Proof. Let ci = (i,j) denote the internal cell of Q with is(ci) = s such that its 
row number is the largest possible under this condition. First suppose that ci is 
in the (n — l)st row, i.e., z = n — 1. In this case, clearly ci and its left neighbor 
(n — l,j — 1) are greater than c in the lexicographic order of the cells and thus 
are floating. If the color of Q is white then the cycle contains the boundary cells 
in the bottom row and hence the cells (n — I, j ± 1) on both sides of ci must be 
black in order to avoid a white 4-cycle (Figure 7(a)). Therefore, ci is indeed an 
opener. If the color of the cycle Q is black, then cell (n — 2,j) above ci and cell 
(n — l,j — 1) on the left of ci belong to Q together with ci (because otherwise Q 
must contain a cell with index sum strictly greater than that of ci contradicting 
the choice of ci) and hence are black. This implies that cell {n — 2,j — 1) on the 
upper left of ci must be white in order to avoid a black 4-cycle (Figure 7(b)). 
Therefore the left neighbor (n — I, j — 1) of ci is an opener. Now suppose that 
i < n — 1. Then, cell (z -I- l,j) or (z, j -|- 1) is not contained in Q because, by 
assumption, ci is the internal cell with the largest index sum in Q. Moreover, 
cell (z -I- 1, j — I) is not contained in Q because Ci is assumed to have the largest 
row number among the internal cells with the largest index sum in Q. Therefore, 
the situation should look (assuming without loss of generality that Q is black) 
as shown in Figure 7(c). 

If the color of cell (z — 1, j -I- 1), which is marked by • in the figure, is white 
then cell {i — l,j) is an opener satisfying the condition. So, suppose the color of 
cell (z — 1, j -I- I) is black. Then the color of cell (z, j -I- I), marked by o, must be 
white in order to avoid a black 4-cycle. Now, cell (z, j — 1) is a potential opener 
and if it is not indeed an opener then cell (z -I- I, j — 1), marked by o, must be a 
spike. Because the color of cell (z, j -I- 1) is white, it is a robust spike. 



We are now ready to state and prove our main lemma for the theorem. 
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Fig. 7. Situations in the proof of Proposition 4. Cell ci is white and lightly 
shaded in (a), is black and painted dark in (b) and in (c). 

Lemma 1. Let A be a non-canonical WFC and let c he a spike of A which 
is either the lexicographically smallest floating cell of A or a robust spike of A. 
Then, there is a sequence of flips {ci, di}, {c 2 , ^ 2 }; ■ ■ ■ , {cm, dm} and a sequence 
of WFCs A = Aq, Al, A 2 , ■ ■ ■ Am such that 

1. Ak is the result of applying flip {ck,dk} to Ak-i, for 1 < k < m, 

2. Ck and dk are floating in A, for 1 < k < m, 

3. is(dfc) > is(cfc) for \ < k < m and is(cfc) > is(dfc+i) for 1 < k < m, and 

4- O-m — C. 

Remark Note that this sequence inverts the color of c so that if c is the smallest 
floating cell then it is fixed as a result. 

Proof. The proof is by induction on 2n — is(c). The base case trivially holds 
because if is(c) > 2n — 1 then c cannot be a spike. Let c = (i,j) be a spike with 
is(c) < 2n — 2 that is either the smallest floating cell or an arbitrary robust spike. 
Let Q be its associated cycle. 

(Case 1: spike c is long.) We claim that, the cycle Q associated with spike c 
contains a cell whose index sum is at least is(c) + 2. To see this, first note that Q 
separates cell {i + 1, j) below c from cell {i — l,j) above c. If cell {i + 1, j) is 
enclosed inside Q, then the claim is obvious. So, suppose {i — 1, j) is inside Q. 
This is impossible if c is the lexicographically smallest floating cell, because then 
inverting c will make the coloring down to row i identical to that of the canonical 
coloring and leaves no monochromatic cycle in that part. Therefore, c is a robust 
spike in this case. Therefore, cell {i — 1, j + 1) is of the same color as {i — l,j) 
and is also enclosed inside Q; Q must contain a cell with index sum is(c) + 2. 

If Q contains an internal cell with index sum at least is(c) + 2 then, by 
Propositions 4, Q has a potential opener d with is(d) > is(c) + 1 such that d is 
indeed an opener or has a robust spike di. If d is indeed and opener, a single 
flip {c, dj proves the lemma, while if di is a robust spike then we may apply 
the induction hypothesis to the pair {A,di) and append the flip {c, dj to the 
resulting sequence to obtain the sequence for the original pair {A, c). 

It remains to examine the special case where the only cell in Q with index 
sum at least is(c)+2 is a boundary cell. This happens only when j = n—2 and the 
color of c is white, so the situation looks as in Figure 8(a). We let d = {i,n— 1), 
the cell to the right of c, and a single flip {c, dj does the task. 

(Case 2: spike c is short.) In this case, the cycle Q associated with spike c is 
a 6-cycle formed by cells {i',f), * < *' < * + 1, j — 1 < j' < j + 1- We proceed 
according to the following subcases. 
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(Case 2.1) c = (i,j) is a robust spike. Then, we cannot have i = n — 1. To see 
this, suppose i = n—1. Since Q then contains boundary cells in the nth row that 
are white, the color of the spike c is black and, since c is a robust spike, we must 
have a situation as in Figure 8(b) which contains a white cycle, a contradiction. 
Thus, cell c? = (i + 1, j) is an internal cell. If di = (i + 2, j) is not a spike then flip 
{c,d} is applicable to A so we are done (see Figure 8(c)). If c?i is a spike, then 
we argue that it must be robust. Assume without loss of generality that cell c is 
black. Since c is a spike, cell {i,j — 1) on the left of c is white. Moreover, from the 
assumption that c is a robust spike, either cell {i — l,j — 2) or cell {i — 1, j + 2) 
is white; assume that cell {i — l,j — 2) is white without loss of generality. This 
implies that cell {i,j — 2) is also white as noted before. Since di = {i + 2, j) is 
a white spike, the internal cell (i + l,j — 1) must also be white. Summing up, 
the three cells {i,j — 2), {i,j — 1) and {i + l,j — 1) are all white, so that cell 
(i + 1, j — 2) must be black in order to avoid a white 4-cycle, making d\ & robust 
spike (Figure 8(d)). Therefore, we may apply the induction hypothesis to the 
pair (A,c?i) and append flip {c, d} to the resulting sequence. 

(Case 2.2) c = (i, j) is the smallest floating cell of A. We first argue that i < n—2. 
To see this, suppose that i = n — 1. Then, because Q includes cells in the nth 
row which are white, the color of c must be black. Since this is the right color 
for the (n — l)st row in the canonical coloring, this is a contradiction to the 
assumption that c is the smallest floating cell of A. Thus, cell d = {i + l,j) is 
an internal cell. If cell di = {i + 2, j) is not a spike, then flip {c, d} is applicable 
to A so we are done. When di is a spike, we only need to establish that it is a 
robust spike, because then the induction would work. 

To verify that di is robust, first suppose that j > 3. Assume without loss of 
generality that cell c is black. Then, the cells (i, j — 2) and (i,j — 1) (the two 
cells to the left of c) are white, because c is the smallest floating cell of A. Cell 
{i -I- 1, j — 1) belongs to the 6-cycle Q associated with the spike c and therefore 
are also white. Therefore, cell {i + 1, j — 2) must be black in order to avoid a 
white 4-cycle, making di a robust spike. See Figure 8(e). Next suppose j = 2. 
Then cell c must be white. To see this, suppose c is black. Then cell (i, 1) to the 
left of c is white and, since spike c is short, cell {i + 1, 2) below c is also white. 
Therefore, to avoid a cross, cell {i + 1,1) must be white and a white 4-cycle is 
formed by cells (f, 0), (i, 1), {i -1-1,0), and {i + 1, 1), a contradiction. So suppose 
the color of c is white. Then, the color of the spike d\ is black, and therefore 
it is indeed a robust spike because the boundary cell (z -I- 1, 0) is white. Finally 
suppose J = 1. This can only happen when i is odd and hence the color of c is 
black (because if i is even then the absence of white 4-cycles forces c to have 
the right color black for the canonical coloring). But then, since the three cells 
(z -I- 1, 0), (z -I- 1, 1) and (z -I- 2, 0) are white, d\ must be black, so that d\ is not 
indeed a spike. See Figure 8(f), where d\ is marked by o. This completes the 
case analysis and hence the proof of the lemma. 

Our theorem is a straightforward consequence of this lemma. Given an arbitrary 
WFC, we iteratively apply the lemma fixing cells one by one in the lexicographic 
order until we obtain the canonical WFC. The number of flips to be applied in 
order to fix one cell is at most n so that the total number of flips is at most 2rA . 
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Fig. 8. Situations in the proof of Lemma 1 



4 Concluding Remarks 

We have shown that the space of GFCs is connected when an edge is given to 
each pair of GFCs that are convertible to each other by one flip. In fact, we have 
shown that the diameter of the graph is at most n^. An obvious lower bound 
on the diameter is f2{n?) and it may not be very difficult to improve our upper 
bound into an asymptotically tight one. 

Our result guarantees that, when we apply a sequence of random flips 
to the canonical GFC, every GFC has a positive probability of being generated. 
Although we have no theoretical results on the probability distribution after some 
number of flips are applied, an experimental study suggests that the random walk 
in the space of GFCs mixes quite rapidly. A theoretical analysis of the rate of 
mixing is an important direction of the future research. 
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Abstract. We consider all planar oriented curves that have the follow- 
ing property depending on a fixed angle if>. For each point B on the 
curve, the rest of the curve lies inside a wedge of angle tp with apex 
in B. This property restrains the curve’s meandering, and for p < ^ 
this means that a point running along the curve always gets closer to all 
points on the remaining part. For all ip < tt, we provide an upper bound 
c{p) for the length of such a curve, divided by the distance between its 
endpoints, and prove this bound to be tight. A main step is in proving 
that the curve’s length cannot exceed the perimeter of its convex hull, 
divided by 1 -I- cos p. 

Keywords: Self-approaching curves, convex hull, detour, arc length. 



1 Introduction 

Let / be an oriented curve in the plane running from A to Z, and let ip be an 
angle in [0,7r). Suppose that, for every point B on /, the curve segment from 
B to Z is contained in a wedge of angle p with apex in B. Then the curve / 
is called p- self- approaching, generalizing the self-approaching curves introduced 
by Icking and Klein [5]. 

At the 1995 Dagstuhl Seminar on Computational Geometry, Seidel [2] posed 
the following open problems. Is there a constant, c{p), so that the length of 
every (p-self-approaching curve is at most c{p) times the distance between its 
endpoints? If so, how small can one prove c(p) to be? 

Both questions are answered in this paper. We provide, for each p in [0,7r), 
a constant c(jp) with the above property, and prove it minimal by constructing 
a (/3-self-approaching curve such that this factor c{p) is achieved. 

Self-approaching curves are interesting for different reasons. If a mobile robot 
wants to get to the kernel of an unknown star-shaped polygon and continuously 
follows the angular bisector of the innermost left and right reflex vertices that 
are visible, the resulting path is self-approaching for p = tt/ 2; see [5]. Since the 
value of c(7t/2) is known to be « 5.3331, as already shown in Icking et al. [6], one 
immediately obtains an upper bound for the competitive factor of the robot’s 

* This work was supported by the Deutsche Forschungsgemeinschaft, grant K1 655/8-2, 
and by the Spezialforschungsbereich Optimierung und Kontrolle. 
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strategy. Improving on this, Lee and Chwa [7] give a tight upper bound of tt + 1 
for this factor, and Lee et al. [8] present a different strategy that achieves a factor 
of 3.829, while a lower bound of 1.48 is shown by Lopez-Ortiz and Schuierer [9]. 

In the construction of spanners for euclidean graphs, one can proceed by 
recursively adding to the spanning structure a point from a cone of angle ip, 
resulting in a sequence pi,p 2 , ■ ■ ■ ,Pn such that for each index i, Pi+i, ■ ■ ■ ,Pn are 
contained in a cone of angle tp with apex pp, see Ruppert and Seidel [11] or Arya 
et al. [3]. (Note, however, that such a sequence of points does not necessarily 
define a t/s-self-approaching polygonal chain because the property may not hold 
for every point in the interior of an edge.) 

Finally, such properties of curves are interesting in their own right. For ex- 
ample, in the book by Croft et al. [4] on open problems in geometry, curves with 
increasing chords are defined by the property that for every four consecutive 
points A, B, C, D on the curve, B is closer to C than A to D. The open problem 
of how to bound the length of such curves divided by the distance between its 
endpoints has been solved by Rote [10]; he showed that the tight bound equals 
|7T. There is a nice connection to the curves studied in this paper. Namely, 
a curve has increasing chords if and only if it is 7r/2-self-approaching in both 
directions. 

In this paper we generalize the results of [6] to arbitrary angles (^ < tt. In 
Sect. 2 we prove, besides some elementary properties, the following fact. Let B, 
C, and D denote three consecutive points on a (p-self-approaching curve /. Then 

CD < BD — cos (p ■ length(/[R . . C]) 

holds, where CD denotes the euclidean distance between points C and D and 
length(/[R . . C]) denotes the length of / between B and C. This property 
accounts for the term “self-approaching”; in fact, for (/? < 7t/2 the factor cosf^ 
is not negative, so that CD < BD holds: The curve always gets closer to each 
point on its remaining part. Although this property does not hold for (^ > ^, 
we will nevertheless see that our analysis of the tight upper bound cipp) directly 
applies to this case, too. 

In Sect. 3 we show that the length of a (/9-self-approaching curve cannot 
exceed the perimeter of its convex hull, divided by 1 -I- cos(/9. This fact is the 
main tool in our analysis. It allows us to derive an upper bound for the curve’s 
length by circumscribing it with a simple, closed convex curve whose length can 
be easily computed, see Sect. 4. Finally, in Sect. 5, we demonstrate that the 
resulting bound is tight, by constructing (/j-self-approaching curves for which the 
upper bounds are achieved. 

The proofs of Lemma 3 and Theorem 1, which are omitted or only sketched 
here due to the limitation of space, can be found in the full paper [1]. 

2 Definitions and properties 

For two points B and C let r{B, C) denote the ray starting at B and passing 
through C. We simply write BC for the euclidean distance between B and C. 
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We consider plane curves /: [0 . . 1] — parameterized by a parameter that 
is usually denoted by t S [0 .. 1]. It is considered as an oriented curve, i.e., it 
has a specified direction from beginning to end. 

We do not make any assumptions about smoothness or rectifiability of the 
curve, although it will turn out that (/j-self-approaching curves are rectifiable. 
For a point X on the curve we write tx for the parameter value with f(tx) = X 
if no confusion arises. (We will see below in Lemma 1 that tx is in fact unique.) 

For two points B = f{tB) and C = f{tc) on / with ts < tc, we write 
f[B .. C] or fits ■■ tc] for the part of / between B and C. By f[B . .] we denote 
the part of / from B to the end. By length(/), length(/[R . . C]), etc., we mean 
the length of the curve or of an arc. 

Definition 1. A curve f is called (/j-self-approaching for 0 < </? < tt if for any 
point B on f there is a wedge of angle ip at point B which contains f[B . .]. In 
other words, for any three points B, C, D with ts < tc < tjo, the angle CBD, if 
it is defined, is at most ip. 

Let f be an oriented curve from A to Z . Then the quantity 

length(/[A . . Z]) 

AZ 

is called the detour of f. 

The detour of a curve is the reciprocal of the minimum qrowth rate used 
in [10]. 

We wish to bound the detour of f^-self-approaching curves. The first definition 
of f^-self-approaching curves also makes sense for ip > tt , but then any circular 
arc connecting two points A and Z is (/J-self-approaching, which means that the 
detour of such curves is not bounded. Therefore, we restrict our attention to 
the case ip < tt . Each 0-self-approaching curve is a line segment and its detour 
equals 1. 

Lemma 1. A ip -self- approaching curve does not go through any point twice. 

Proof. Suppose the curve visits point X twice. Shortly after the first visit of 
X there is a point on the curve for which the t/s-self-approaching property is 
violated. □ 

By this lemma, we can take any point Z on the curve as the origin of a polar 
coordinate system and use a continuous parameterization f(t) = (r{f),Tf(t)) for 
any part f[A..B]of the curve which comes before Z. For a point X = f{tx) 
on the curve we will use the notation X = {r{tx),f>{tx)) ='. (rx,'if>x)- 

Lemma 2. IftA <tB <tz and ip a = V’s (mod 27 t), then rB < va. 

Proof. The assumption > va would lead to an angle BAZ = tt, violating the 
(/3-self-approaching property. (The case = va is also excluded, by Lemma 1, 
but we do not use this fact.) □ 

The following lemma shows, roughly speaking, that a </?-self-approaching 
curve f[A . . Z] is enclosed by two oppositely winding logarithmic spirals 
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through A with pole Z, see Fig. 1. For (p > tt/2, this is true only locally, as 
long as the curve does not leave the vicinity of A. In polar coordinates {r,tjj), a 
(^-logarithmic spiral is the set of all points with 7- = gV’cotvj or r = 




Fig. 1. For 0 < (p < tt/2 the curve is enclosed by two congruent arcs of a 
(^-logarithmic spiral. 



Each ray through the origin intersects the spiral in the same angle ip. For 
p 7^ 7t/2, the length of an arc S'[A . . B] of a (/s-logarithmic spiral S with pole 
Z is given by {AZ — BZ). For p = tt/2, the spiral degenerates to a circle. 
The properties of Lemma 3 are used in Sect. 4 for the circumscription of a 
(^-self-approaching curve by a convex area whose perimeter is easy to determine. 

Lemma 3. Let tA<tB <tz- 

(a) rs < TA • . 

(b) rB < ta — cos p ■ length(/[A ■ ■ B]) . 

Note that Lemma 3 also applies to any subsection of the curve /. 

3 c^-self-approaching curves and the perimeter of their 
convex hull 

In this section we show that the length of a (/9-self-approaching curve is bounded 
by the perimeter of its convex hull, divided by 1 -|- cos p. 

Let conv(P) denote the convex hull of a point set P and peri(P) the length 
of the perimeter of conv(P). 

Theorem 1. For a p-self-approaching curve f with 0 < p < tt, 

(1 -I- cos(p) length(/) < peri(/) . 

The proof can only be sketched here. First, the claim is shown for a polygonal 
chain whose vertices lie on the curve /. This is done by induction on the number 
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of vertices. But the length of a curve / is, by definition, the supremum of the 
lengths of all polygonal chains which are obtained as polygonal subdivisions 
of /. Therefore, an upper bound for the length of all such chains is also an upper 
bound for the length of /. 



4 An upper bound for the detour 



Theorem 2. The length of a (p- self- approaching curve is not greater than c((p) 
times the distance of its endpoints, where 



c(p) := 



1 , 

2I3 + TT + 2 

max , 

/3G[o..f] v5 — 4cosp 

max h(ip,j3), 
0e[o..v] 



for (fi = 0, 
for (fi = 7t/2, 
otherwise, 



with h{(p, P) 



((1 + /cosp 

Y^((l -h -cos/3)2 -h sin^ /3 



The function c{ip) is strictly monotone and continuous for ip G [0, tt). It tends 
to infinity if p tends to tt, see Fig. 2 for its values up to = 1.8. 



c{p) 




Proof. For = 0 we have a straight line segment, and the theorem is obvious. 
So let us assume 0 < p < tt from now on. 

Let / be a <p-self-approaching curve from A to Z. W.l.o.g., we may assume 
that / does not cross the segment AZ . Otherwise we apply the bound cfp) 
successively to each subcurve between to successive curve points on AZ and add 
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up the results. Due to the self-approaching property the curve points on AZ 
appear in the same order as on /, so the overall bound is less than or equal 
to c{lp). 

Assume w. 1. o. g. that the curve starts by leaving A on the left side of the 
directed line AZ. Let AC be the right tangent from A to the curve, touching the 
curve in the point C. (The curve is on the left side of AC.) We select C as far 
as possible from A, and we denote the angle ACZ by j3, see Fig. 3. (For C = Z 
we set f3 = 0.) Note that P < (fi holds, otherwise there is a point Cq before C 
on / such that the angle CCqZ is also greater than (p which contradicts the 
self-approaching property at Cq. 




Fig. 3. A (/3-self-approaching curve must lie in this area. The angle P' shown 
here maximizes h{ip,P). 



Let B denote the first point of intersection of the curve with r{C,Z). (For 
C = Z we take B = A.) Since the curve neither crosses t(A, C) nor the segment 
AZ, Z must lie between B and C. We apply Lemma 3a to the arc f[B . . C], 
considering Z as the origin, and we get 

CZ < BZ ■ . 

The application of the lemma is justified because, by Lemma 2, C is the first 
point of intersection of the curve with r{Z,C). Applying the lemma to f[A . . B] 
with C as the origin, we get 

CB < 

All in all, since CB = CZ + BZ, we obtain 

CA>CZ-{1 + ^ 



( 1 ) 
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Now we select a point C on r(A, C) with C A > CA such that the equation 

C'A = C'Z ■ {I + cot V (-2) 

is fulfilled. Such a point C exists, since, as we move C further away from A, 
the ratio C A : C Z converges to 1, the angle j3' := /_AC Z decreases towards 0 
and hence also converges to 1, whereas 1 -I- is a constant bigger 

than 1. Therefore the inequality (1) changes direction as AC — > oo. We have 
P' < 13 < (f, and also C yf Z. 

In the following let B' be the point on r(C", Z) with B' Z = C Z ■ and 

LC ZB' = 7T, see Fig. 3. 

Lemma 4. A ip- self- approaching curve from A to Z is contained in the convex 
region bounded by the following three curves, see Fig. 3: 

(1) a if-logarithmic spiral from A to B' of polar angle P' with pole C ; 

(2) a if-logarithmic spiral from B' to C of polar angle tt with pole Z ; 

(3) the segment AC . 

Proof. Let B” be the first intersection point of / with r{C,Z). By Lemma 3a 
applied to the pole C, the arc f[A.. B”] lies inside the (^-logarithmic spiral 
S with pole C with polar angle P starting at A. The (^-logarithmic spiral S' 
with pole C through A is obtained from S by stretching it about the center A, 
and hence f[A.. B"] is also contained inside S' , and in part (1) of the region 
boundary. In particular, B" Z < B' Z . It follows with the help of Lemma 2 that, 
between the rays r{Z,A) and r{Z,B'), no point of the whole curve / can lie 
outside S' . 

Now let C" be the first intersection point of / with r{Z,C). By Lemma 3a 
applied to the pole Z, the arc f[B" . . C] lies inside the (p- logarithmic spiral 
T around pole Z with polar angle tt starting at B" . Since B" Z < B' Z, the arc 
lies inside the logarithmic spiral T' which forms part (2) of the region boundary. 
Again, it follows with the help of Lemma 2 that, below the line C'B' , no point 
of the whole curve / can lie outside T'. 

Finally, in the region between the rays ZC and ZA, the curve cannot escape 
across the segment AC , and the proof is complete. □ 

Now we can prove Theorem 2. We treat only the case p yf tt/ 2, where we 
have proper spirals. The case p = tt/ 2, where we have circular arcs, has already 
been treated in [6]. 

We choose a scale such that C'Z equals 1. Now AC" = 

B' Z = holds by construction. Therefore the lengths of the three curves 

in Lemma 4, using the formula for the arc length of a (^-logarithmic spiral, are 
given by 

Li := -^(AC - B'C) = f(l -f - (1 + 

COS(^ COSip V / 

L2 := -^{B'Z - C'Z) = - 1) 

COS p cos p 

La := AC = (i + , 



324 



Oswin Aichliolzer et al. 



and for the distance of the endpoints of / we have 

AZ = \j ((1 + — cos /3')^ + sin^ /?' . 

Altogether we conclude from Theorem 1 



length(/) ^ Li + L 2 + ^3 1 



AZ AZ 1 + cos Lp 

The right term can be transformed to 

^4 _|_ g-ir cot 95^g/3' cot (p 2_ 



1 + COS 9? 



cos ^p \| ((1 + — cos /3')^ + sin^ (3' 



□ 



5 Tightness of the upper bound 

Theorem 3. The upper bound c((p) given in Theorem 2 for the detour ofip-self- 
approaehing curves is tight. 

Proof. We construct a (/3-self-approaching curve / from A to Z as in the proof 
of Lemma 4, the construction is shown in Fig. 4. The curve starts with two 



A 




Fig. 4. The construction of the (/>-self-approaching curve with maximal detour 
for (p=2. 



logarithmic spirals similar to part (1) and part (2) in the proof of Lemma 4, 
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except that part (2) is split into two parts. The segment C Z of length 1 is 
replaced by a t/s-self-approaching zigzag curve of length Lg = from C 




Fig. 5. A sequence of cycloid parts forming a t/s-self-approaching curve inside a 
thin rectangle. 



to Z, which moves inside a thin rectangle along the segment C' Z. This last 
part of the curve between Z and C', see Fig. 5, is obtained by “stacking” small 
cycloids {x, y) = (r(a — sin a), r(l — cos a)), for 0 < a < 2ip. One piece of cycloid 
has “height” ff = r(l — cos2(/3) = Srsin^ ipcos^ ip and length L = Srsin^ We 
must choose the size parameter r in such a way that 1 / H is an even integer n; 
then the curve will precisely connect the points C and Z. The length of the curve 
is then Lj H = which is what we need. The width of the construction is 

W = r{2ip — sm 2(f). We arrange the cycloid pieces on the left side of the segment 
C'Z, as indicated in Fig. 5; then the curve is contained in a rectangle C'ZZC 
of width W = ZZ. The long side ZC' slightly extends the segment ZC by the 
amount 2r — H . 

Now we can describe the whole construction of the curve, in reverse direction. 
The curve consists of the following parts (numbered in accordance with Lemma 4, 
using the notation from there): 

(3') The curve ends with the t/s-self-approaching curve from C to Z described 
above, whose length is 

(2) before, there is a (/?- logarithmic spiral of polar angle 7t/2 with pole Z 
connecting C to some point B\. 

(2') then, a (^-logarithmic spiral of polar angle 7 t/ 2 with pole Z connecting 
Bi to B] 

(!') the curve starts with a (/?- logarithmic spiral of polar angle /3 with pole C 
between a point A and B, where P < is the value for which the 
maximum in the definition of c{ip) is attained. 
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It can be checked that the parts which are logarithmic spirals are always (/s-self- 
approaching, as the line from any point X to the current pole contains the whole 
curve on one side. So obviously the whole curve is (p-self-approaching. Since r 
can be made as small as we like, we have C — > C', Z ^ Z, A ^ A, B ^ B' 
as r — !■ 0, and the logarithmic spirals that we use will approach the “ideal” 
logarithmic spirals that appear in Lemma 4, see Fig. 3. This means that 

length(/) Li + L 2 + (1 + - 2 + ■ cos^ 

cos + e’^ ‘^)e^ — cos /3)^ + sin^ (3 

which equals c(j~p). □ 

6 Conclusions 

Here we analyze the maximum length of curves with an upper bound on the 
angular wedge at A. This condition is not symmetric since it distinguishes the 
source and the target of the curve. One might also consider a symmetric situa- 
tion, where curves are (/J-self-approaching in both directions. 

Generalizations to three dimensions are also completely open. 
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